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PESCRIPTIQN 

DIAGNOSIS AND TREATMENT OF 
AUR-1 and/or AUR-2 RELATED DISORDERS 

Field of the Invention 
The present invention relates to the novel 
protein termed AURORA ONE and AURORA TWO ("AUR-1 and 
AUR-2"), nucleotide sequences encoding AUR-1 and/or AUR- 
2, as well as various products and methods useful for 
the diagnosis and treatment of various AUR-1 and/or AUR- 
2 related diseases and conditions. 

Background of the Invention 

The following description of the background of 
the invention is provided to aid in understanding the 
invention but is not admitted to be prior art to the 
invention. 

Cellular signal transduction is a fundamental 
mechanism whereby external stimuli that regulate diverse 
cellular processes are relayed to the interior of cells. 
One of the key biochemical mechanisms of signal 
transduction involves the reversible phosphorylation of 
proteins, which enables regulation of the activity of 
mature proteins by altering their structure and 
function. 

The best characterized protein kinases in 
eukaryotes phosphorylate proteins on the alcohol moiety 
of serine, threonine and tyrosine residues. These 
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kinases largely fall into two groups, those specific for 
phosphorylating serines and threonines, and those 
specific for phosphorylating tyrosines. Some kinases, 
referred to as "dual specificity" kinases, are able to 
phosphorylate on tyrosine as well as serine/threonine 
residues . 

Protein kinases can also be characterized by 
their location within the cell. Some kinases are 
transmembrane receptor-- type proteins capsible of directly 
altering their catalytic activity in response to the 
external environment such as the binding of a ligand. 
others are non- receptor -type proteins lacking any 
transmembrane domain. They can be found in a variety of 
cellular compartments from the inner surface of the cell 
membrane to the nucleus . 

Many kinases are involved in regulatory 
cascades wherein their substrates may include other 
kinases whose activities are regulated by their 
phosphorylation state. Ultimately the activity of some 
downstream effector is modulated by phosphorylation 
resulting from activation of such a pathway. 

The serine /threonine kinase family includes 
members found at all steps of various signaling 
cascades, including those involved in controlling cell 
growth, migration, differentiation and secretion of 
hormones, phosphorylation of transcription factors 
resulting in altered gene expression, muscle 
contraction, glucose metabolism, control of cellular 
protein synthesis, and regulation of the cell cycle. 



Smnmary ot thg Invention 
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The present invention relates to AUR-l and/or 
AUR-2 polypeptides, nucleic acids encoding such 
polypeptides, cells containing such nucleic acids, 
antibodies to such polypeptides, assays utilizing such 
polypeptides, and methods relating to all of the 
foregoing. The present invention is based upon the 
isolation and characterization of new proteins which we 
have designated AUR-l and/or AUR-2. The polypeptides 
and nucleic acids may be produced using well known and 
standard synthesis techniques when given the sequences 
presented herein. AUR-l and AUR-2 are related 

serine /threonine kinases with short N-terminal 
extensions. The Drosophila and yeast homologs appear to 
be involved in mitotic regulation. The human proteins 
appear to be involved in cancer and/or other signal 
transduction disorders. AURl RNA is broadly expressed 
in rapidly dividing cells, derived from both normal 
tumor tissues. AUR2 RNA however, is expressed in a more 
restricted pattern being low or absent in most normal 
tissues, cuid abundant in only a subset of tumor-derived 
cell lines, particularly those of colorectal origin. 
Both Auroral and Aurora2 show intermediate expression in 
fetal liver, adult testis, euid thymus, suggestive of a 
normal role for these proteins in meiotic division. 
Both AURl cmd AUR2 appear to regulate nuclear division, 
with disruption of their signaling resulting in 
polyploid cells. This phenotype is likely due to 
chromosomal missegregation, as seen with their yeast 
homologue IPLl. Since polyploidy is a hallmark of tumor 
cells and in cells defective in the p53 tumor 
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suppressor, we are testing the role AURl and AUR2 in 
cellular transformation. 

Primary sec[uence analysis of the human genes 
reveals that they contain a highly conserved C- terminal 
protein kinase domain and a weakly conserved N- terminal 
5 domain of 74 to 130 amino acids that may play a 

regulatory role or function as a substrate binding 
motif. The human genes also contain a cAMP/PKA 
phosphorylation site R/KR/KXS/T in the activation loop 
of the kinase domain, which suggests a regulatory 

10 pathway similar to the cell cycle regulated CDC2/CDK- 
related proteins. 

Southern analysis with probes derived from the 
unique N- terminal regions of AURl and AUR2 indicate 
that they exist in as a single copy genes in human 

15 cells- However, under low stringency conditions, we 
were able to detect 1,3 kb and 3,2 kb Sad fragments 
which weakly hybridize to the AURl probe. Cloning and 
sequence analysis reveals this region to encode an 
intronless AURl-related pseudogene (termed AUR3), with 

20 multiple frame shifts. Furthermore, immediately 

upstream of the AUR3 pseudogene is a region with complex 
inverted repeats predicted to form a very stable hairpin 
loop. AUR3 DNA sequence is homologous to AURl beginning 
from the first nucleotide of the AURl cDNA. Immediately 

25 upstream of this site, is the predicted hairpin loop of 
AUR3. We are currently characterizing AURl genomic 
clones to determine if the homology to AUR3 continues 
upstream from this nucleotide, and whether the AURl cDNA 
includes or be preceded by a similar hairpin loop. 
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The utility of the present invention includes 
the ability to screen for inhibitors of cell growth and 
to develop small molecule therapeutics for treating 
cancers . 

Thus, in a first aspect the invention features 
an isolated, enriched, or purified nucleic acid encoding 
a AUR-1 and/or AUR-2 polypeptide. 

By "AUR-1 and/or AOR-2 polypeptide" is meant 
an amino acid sequence substantially similar to the 
sequence shovm in SEQ ID NO: 3 or SEQ ID NO: 4, or 
fragments thereof. A sequence that is substantially 
similar will preferably have at least 90% identity (more 
preferably at least 95% and most preferably 99-100%) to 
the sequence of SEQ ID NO: 3 or SEQ ID N0:4. 

By "identity" is meant a property of sequences 
that measures their similarity or relationship. 
Identity is measured by dividing the number of identical 
residues by the total number of residues and multiplying 
the product by 100. Thus, two copies of exactly the 
same sequence have 100% identity, but sequences that are 
less highly conserved and have deletions, additions, or 
replacements may have a lower degree of identity. Those 
skilled in the art will recognize that several computer 
programs are available for determining sequence 
identity. 

By "isolated" in reference to nucleic acid is 
meant a polymer of 6 (preferably 21, more preferably 39, 
most preferably 75) or more nucleotides conjugated to 
each other, including DNA or RNA that is isolated from a 
natural source or that is synthesized. In certain 
embodiments of the invention longer nucleic acids are 
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preferred, for example those of 300, 600, 900 or more 
nucleotides and/or those having at least 50%, 60%, 75% , 
90%, 95% or 99% identity to the full length sequence 
shown in SEQ ID NO:l or SEQ ID NO:2. The isolated 
nucleic acid of the present invention is unique in the 
5 sense that it is not found in a pure or separated state 
in nature. Use of the term "isolated" indicates that a 
naturally occurring sequence has been removed from its 
normal cellular f i-e, . chromosomal) environment* Thus, 
the sequence may be in a cell -free solution or placed in 
10 a different cellular environment. The term does not 
imply that the sequence is the only nucleotide chain 
present, but that it is essentially free (about 90 - 95% 
pure at least) of non- nucleotide material naturally 
associated with it and thus is meant to distinguished 
15 from isolated chromosomes. 

By the use of the term "enriched" in reference 
to nucleic acid is meant that the specific DNA or RNA 
sequence constitutes a significantly higher fraction (2 
- 5 fold) of the total DNA or RNA present in the cells 
20 or solution of interest than in normal or diseased cells 
or in the cells from which the sequence was taken. This 
could be caused by a person by preferential reduction in 
the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA 
25 or RNA sequence, or by a combination of the two. 

However, it should be noted that enriched does not imply 
that there are no other DNA or RNA sequences present, 
just that the relative amount of the sequence of 
interest has been significantly increased. The term 
30 significant here is used to indicate that the level of 
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increase is useful to the person making such an 
increase, and generally means an increase relative to 
other nucleic acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term 
also does not imply that there is no DNA or RNA from 
other sources. The other source DNA may, for example, 
comprise DNA from a yeast or bacterial genome^ or a 
cloning vector such as pUC19. This term distinguishes 
from naturally occurring events, such as viral 
infection, or tumor type growths, in which the level of 
one mRNA may be naturally increased relative to other 
species of mRNA. That is, the term is meant to cover 
only those situations in which a person has intervened 
to elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that 
a nucleotide sequence be in purified form. The term 
"purified" in reference to nucleic acid does not require 
absolute purity (such as a homogeneous preparation) ; 
instead, it represents an indication that the sequence 
is relatively purer than in the natural environment 
(compared to the natural level this level should be at 
least 2-5 fold greater, e.g., in terms of mg/ml) . 
Individual clones isolated from a cDNA library may be 
purified to electrophoretic homogeneity. The claimed 
DNA molecules obtained from these clones could be 
obtained directly from total DNA or from total RNA. The 
cDNA clones are not naturally occurring, but rather are 
preferably obtained via manipulation of a partially 
purified naturally occurring substance (messenger RNA) . 
The construction of a cDNA library from mRNA involves 
the creation of a synthetic substance (cDNA) and pure 
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individual cDNA clones can be isolated from the 
synthetic library by clonal selection of the cells 
carrying the cDNA library. Thus, the process which 
includes the construction of a cDNA library from mRNA 
and isolation of distinct cDNA clones yields an 
5 approximately 10^- fold purification of the native 

message. Thus, purification of at least one order of 
magnitude, preferably two or three orders, and more 
preferably four or five orders of magnitude is expressly 
contemplated . 

j^O By "a AUR-1 and/or AUR-2 polypeptide" is meant 

25 (preferably 30, more preferably 35, most preferably 
40) or more contiguous amino acids set forth in the full 
length amino acid sequence of SEQ ID NO: 3 or SEQ ID 
NO: 4, or a functional derivative thereof as described 
15 herein. In certain aspects, polypeptides of 100, 200, 

3 00 or more amino acids are preferred. The AUR-1 and/or 
AUR-2 polypeptide can be encoded by a full-length 
nucleic acid sequence or any portion of the full-length 
nucleic acid sequence, so long as a functional activity 
20 of the polypeptide is retained. 

In preferred embodiments the isolated nucleic 
acid comprises, consists essentially of, or consists of 
a nucleic acid sequence set forth in the full length 
amino acid sequence of SEQ ID NO: 3 or SEQ ID NO: 4, a 
25 functional derivative thereof, or encodes at least 25, 
30, 35, 40, 50, 100, 200, or 300 contiguous amino acids 
thereof; the AUR-1 and/or AUR-2 polypeptide comprises, 
consists essentially of, or consists of at least 25, 30, 
35, or 40 contiguous amino acids of a AUR-1 and/or AUR-2 
30 polypeptide. The nucleic acid may be isolated from a 
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natural source by cDNA cloning or subtractive 
hybridization; the natural source may be mammalian 
(human) blood, semen, or tissue and the nucleic acid ma] 
be synthesized by the triester method or by using an 
automated DNA synthesizer. In yet other preferred 
embodiments the nucleic acid is a conserved or unique 
region, for example those useful for the design of 
hybridization probes to facilitate identification and 
cloning of additional polypeptides, the design of PGR 
probes to facilitate cloning of additional polypeptides, 
and obtaining antibodies to polypeptide regions. 
Examples of amino acid sequences of the present 
invention include the following amino acid sequences 
(the isolated, purified or enriched nucleic acids 
encoding them are also within the scope of the present 
invention) : 

ENSYPWPYGRQ (SEQ ID NO: 5) 
CISGP (SEQ ID NO:G) 
QFPO (SEQ ID NO: 7) 
VNSGQ (SEQ ID NO: 8) 
RKEPVTPSA-LV (SEQ ID NO: 9) 
LMSRSNVQPTAAP (SEQ ID NO: 10) 
VQNQKQKQLQATSVPH (SEQ ID NO: 11) 
PVSRPLNNTQK (SEQ ID NO: 12) 
VMENSSGTPD (SEQ ID NO: 13) 
ILTRHFTID (SEQ ID NO: 14) 
SKQPLPSAPENNPEEQIASKQK (SEQ ID NO: 15) 

By "conserved nucleic acid regions", are meant 
regions present on two or more nucleic acids encoding a 
AUR-l and/or AUR-2 polypeptide, to which a particular 
nucleic acid sequence can hybridize under lower 
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stringency conditions. Examples of lower stringency 
conditions suitable for screening for nucleic acid 
encoding AUR-1 and/or AUR-2 polypeptides are provided in 
Abe, et al. .t, RiqI . Chem. , 12:13361 (1992) (hereby 
incorporated by reference herein in its entirety, 
5 including any drawings) . Preferably, conserved regions 
differ by no more than 5 out of 20 nucleotides. 

By "unique nucleic acid region" is meant a 
sequence present in a full length nucleic acid coding 
for a AUR-1 and/or AUR-2 polypeptide that is not present 

10 in a sequence coding for any other naturally occurring 
polypeptide. Such regions preferably comprise 30 or 45 
contiguous nucleotides present in the full length 
nucleic acid encoding a AUR-1 and/or AUR-'2 polypeptide. 
In particular, a unique nucleic acid region is 

15 preferably of mammalian origin. 

The invention also features a nucleic acid 
probe for the detection of a AUR-1 and/or AUR-2 
polypeptide or nucleic acid encoding a AUR-1 and/or AUR- 
2 polypeptide in a sample. The nucleic acid probe 

20 contains nucleic acid that will hybridize to a sequence 
set forth in SEQ ID NO:l or SEQ ID NO: 2 or a functional 
derivative thereof. 

In preferred embodiments the nucleic acid 
probe hybridizes to nucleic acid encoding at least 12, 

25 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous 

amino acids of the full-length sequence set forth in SEQ 
ID NO:l or SEQ ID NO: 2 or a functional derivative 
thereof. Various low or high stringency hybridization 
conditions may be used depending upon the specificity 

30 and selectivity desired. Under stringent hybridization 
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conditions only highly complementary nucleic acid 
sequences hybridize. Preferably, such conditions 
prevent hybridization of nucleic acids having i or 2 
mismatches out of 20 contiguous nucleotides. 

Methods for using the probes include detecting 
the presence or amount of ADR-l and/or AUR- 2 RNA in a 
sample by contacting the sample with a nucleic acid 
probe under conditions such that hybridization occurs 
and detecting the presence or amount of the probe bound 
to AUR-l and/or AUR-2 RNA. The nucleic acid duplex 
formed between the probe and a nucleic acid sequence 
coding for a ADR-i and/or AOR-2 polypeptide may be used 
in the identification of the sequence of the nucleic 
acid detected (for exanple see. Nelson et al., in 
WQniSQt.Opic DNft Probe TfiChniqiieP , p. 275 Academic Press, 
San Diego (Kricka, ed., 1992) hereby incorporated by 
reference herein in its entirety, including any 
drawings) . Kits for performing such methods may be 
constructed to include a container means having disposed 
therein a nucleic acid probe. 

The invention also features recombinant 
nucleic acid, preferably in a cell or an organism. The 
recombinant nucleic acid may contain a sequence set 
forth in SEQ ID N0:1 or SEQ ID NO: 2 or a functional 
derivative thereof and a vector or a promoter effective 
to initiate transcription in a host cell. The 
recombinant nucleic acid can alternatively contain a 
transcriptional initiation region functional in a cell, 
a sequence complimentary to an RNA sequence encoding a 
AUR-l and/or AUR-2 polypeptide and a transcriptional 
termination region functional in a cell. 
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In another aspect the invention features an 
isolated, enriched, or purified AUR-l and/or AUR-2 
polypeptide . 

By "isolated" in reference to a polypeptide is 
meant a polymer of 2 (preferably 7, more preferably 13. 
most preferably 25) or more amino acids conjugated to 
each other, including polypeptides that are isolated 
from a natural source or that are synthesized. In 
certain aspects longer polypeptides are preferred, such 
as those with 402, 407, 413, or 425 contiguous amino 
acids set forth in SEQ ID NO: 3 or SEQ ID NO: 4. The 
isolated polypeptides of the present invention are 
unique in the sense that they are not found in a pure or 
separated state in nature- Use of the term "isolated" 
indicates that a naturally occurring sequence has been 
removed from its normal cellular environment. Thus, the 
sequence may be in a cell-free solution or placed in a 
different cellular environment. The term does not imply 
that the sequence is the only amino acid chain present, 
but that it is essentially free (about 90 - 95% pure at 
least) of non-amino acid material naturally associated 
with it. 

By the use of the term "enriched" in reference 
to a polypeptide is meant that the specific amino acid 
sequence constitutes a significantly higher fraction (2 
- 5 fold) of the total of amino acids present in the 
cells or solution of interest than in normal or diseased 
cells or in the cells from which the sequence was taken. 
This could be caused by a person by preferential 
reduction in the amount of other amino acids present, or 
30 by a preferential increase in the amount of the specific 



20 



25 
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amino acid sequence of interest, or by a combination of 
the two. However, it should be noted that enriched does 
not imply that there are no other amino acid sequences 
present, just that the relative amount of the sequence 
of interest has been significantly increased. The term 
significant here is used to indicate that the level of 
increase is useful to the person making such an 
increase, and generally means an increase relative to 
other amino acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term 
also does not imply that there is no amino acid from 
other sources. The other source amino acid may, for 
example, comprise amino acid encoded by a yeast or 
bacterial genome, or a cloning vector such as pUC19. 
The term is mesuit to cover only those situations in 
which man has intervened to elevate the proportion of 
the desired nucleic acid* 

It is also advantageous for some purposes that 
an amino acid sequence be in purified form. The term 
"purified" in reference to a polypeptide does not 
require absolute purity (such as a homogeneous 
preparation) ; instead, it represents an indication that 
the sequence is relatively purer than in the natural 
environment (compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of 
mg/ml) . Purification of at least one order of 
magnitude, preferably two or three orders, and more 
preferably four or five orders of magnitude is expressly 
contemplated. The substance is preferably free of 
contamination at a functionally significant level, for 
example 90%, 95%, or 99% pure. 
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In preferred embodiments the AUR-1 and/or AUR- 
2 polypeptide contains at least 25, 30, 35, 40, 50^ 100, 
150, 200, 250, 300/ or 350 contiguous amino acids of the 
full-length sequence set forth in SEQ ID NO: 3 or SEQ ID 
NO: 4/ or a functional derivative thereof. 
5 In yet another aspect the invention features 

an antibody ( e.g. . a monoclonal or polyclonal antibody) 
having specific binding affinity to a AUR-1 and/or AUR-2 
polypeptide- The antibody contains a sequence of amino 
acids that is able to specifically bind to a AUR-l 

10 and/or AUR-2 polypeptide. By "specific binding 

affinity" is meant that the antibody binds to AUR-1 
and/or AUR-2 polypeptides with greater affinity than it 
binds to other polypeptides under specified conditions. 

Antibodies having specific binding affinity to 

15 a AUR-1 and/or AUR-2 polypeptide may be used in methods 
for detecting the presence and/or amount of a AUR-1 
and/or AUR-2 polypeptide in a sample by contacting the 
sample with the antibody under conditions such that an 
immunocomplex forms and detecting the presence and/or 

20 amount of the antibody conjugated to the AUR-1 and/or 

AUR-2 polypeptide. Diagnostic kits for performing such 
methods may be constructed to include a first container 
means containing the antibody and a second container 
means having a conjugate of a binding partner of the 

25 antibody and a label. 

In another aspect the invention features a 
hybridoma which produces an antibody having specific 
binding affinity to a AUR-1 and/or AUR-2 polypeptide. 
By "hybridoma" is meant an immortalized cell line which 

30 is capable of secreting an antibody, for example a AUR-1 
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and/or AUR-2 antibody. In preferred embodiments the 
AUR-l and/or AUR-2 antibody comprises a sequence of 
amino acids that is able to specifically bind a AUR-l 
and/or AUR-2 polypeptide. 

In another aspect, the invention describes a 
polypeptide comprising a recombincuit AUR-l and/or AUR-2 
polypeptide or a unique fragment thereof. By "unique 
fragment," is meant an amino acid sequence present in a 
full-length AUR-l and/or AUR-2 polypeptide that is not 
present in any other naturally occurring polypeptide. 
Preferably, such a sequence comprises 6 contiguous amine 
acids present in the full sequence. More preferably, 
such a sequence con^irises 12 contiguous amino acids 
present in the full sequence. Even more preferably, 
such a sequence comprises 18 contiguous amino acids 
present in the full sequence. 

By "recombinant AUR-l and/or AUR-2 
polypeptide" is meant to include a polypeptide produced 
by recombinant DNA techniques such that it is distinct 
from a naturally occurring polypeptide either in its 
location <e.g. , present in a different cell or tissue 
than found in nature) , purity or structure . Generally, 
such a recombinant polypeptide will be present in a cell 
in an amount different from that normally observed in 
nature . 

In another aspect, the invention describes a 
recombinant cell or tissue containing a purified nucleic 
acid coding for a AUR-l and/or AUR-2 polypeptide. in 
such cells, the nucleic acid may be under the control of 
its genomic regulatory elements, or may be under the 
control of exogenous regulatory elements including an 
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exogenous promoter. By "exogenous" it is meant a 
promoter that is not normally coupled in vivo 
trsmscriptionally to the coding sequence for the AUR-1 
and/or AUR-2 polypeptide. 

In another aspect, the invention features a 
5 AUR-1 and/or AUR-2 polypeptide binding agent able to 

bind to a AUR-1 and/or AUR-2 polypeptide. The binding 
agent is preferably a purified antibody which recognizes 
an epitope present on a AUR-1 and/or AUR~2 polypeptide. 
Other binding agents include molecules which bind to the 
10 AUR-1 and/or AUR-2 polypeptide and analogous molecules 
which bind to a AUR-1 and/or AUR-2 polypeptide. Such 
binding agents may be identified by using assays that 
measure AUR-1 and/or AUR-2 binding partner activity, 
such as those that measure PDGFR activity. 
25 By "purified" in reference to an antibody is 

meant that the antibody is distinct from naturally 
occurring antibody, such as in a purified form. 
Preferably, the antibody is provided as a homogeneous 
preparation by standard techniques. Uses of antibodies 
20 to the cloned polypeptide include those to be used as 
therapeutics, or as diagnostic tools- 

The invention features a method for screening 
for human cells containing a AUR-1 and/or AUR-2 
polypeptide or an equivalent sequence. The method 
25 involves identifying the novel polypeptide in human 

cells using techniques that are routine and standard in 
the art, such as those described herein for identifying 
AUR-1 and/or AUR-2 f e.g. , cloning. Southern or Northern 
blot analysis, Hk situ hybridization, PCR amplification, 
30 etc . ) . 
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The invention also features methods of 
screening human cells for binding partners of AUR-1 
and/or AUR-2 polypeptides and screening other organisms 
for ADR-1 and/or ADR- 2 or the corresponding binding 
partner. The present invention also features the 
purified, isolated or enriched versions of the peptides 
identified by the methods described above. 

In another aspect, the invention provides an 
assay to identify agents capable of interfering with the 
interaction between AUR-l and/or AUR-2 and a AUR-l 
and/or AUR-2 binding partner. Such assays may be 
performed in vitro or in vivo and are described in 
detail herein or Ccm be obtained by modifying existing 
assays, such as the growth assay described in Serial No. 
08/487,088, filed June 7, 1995, (incorporated herein by 
reference including any drawings) or the assays 
described in Serial No. 60/005,167, filed October 13, 
1995 (incorporated herein by reference including any 
drawings) . Another assay which could be modified to use 
the genes of the present invention are described in 
International Application No. WO 94/23039, published 
October 13, 1994. Other possibilities include detecting 
kinase activity in an autophosphoirylation assay or 
testing for kinase activity on standard substrates such 
as histones, myelin basic protein, gamma tubulin, or 
centrosomal proteins. Binding partners may be 
identified by putting the N- terminal portion of the 
protein into a two-hybrid screen or detecting 
phosphotyrosine of a dual specificity kinase. Fields 
and Song, U.S. Patent No. 5,283,173, issued February 1, 
1994 and is incorporated be reference herein. 
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One means by which inhibitors of Aurora 
activity may be defined is a screening system using a 
temperature-sensitive yeast mutant as described by Chan 
and Botstein (Genetics 135:677-691, 1993); see also 
Francisco et al . , Mol. Cell. Bio, 14 (7) :4731-4740 ^ 1994) 
5 both of which are hereby incorporated herein by 

reference in their entirety including any drawings. 
Briefly, yeast strain CCY72-3D-1 (ipl 1-2), which 
expresses a temperature sensitive form of the yeast 
homologue of Aurora (ipll) while viable at 26'C is 

10 incapable of growth at 37**C. Transfection of this 

strain with an expression plasmid containing a hybrid 
Aurora gene consisting of the N-terminal portion of 
ipll, containing the putative substrate interaction 
domain(s)), and the C- terminal portion of Auroras 1 or 

15 2, containing the catalytic domain, overcomes this 
sensitivity to growth temperature. The Aurora- 
expressing yeast strain is then grown at 37'*C in the 
presence of a test substance . No growth will be evident 
in the present of substances that inhibit Aurora 

20 catalytic function. Potential inhibitors include small 
molecular weight chemicals and/or natural products 
isolation from diverse organisms such as fungi, marine 
organisms, plants, etc, 

The summary of the invention described above 

25 is non- limiting and other features and advantages of the 
invention will be apparent from the following descrip- 
tion of the preferred embodiments, and from the claims. 

Peseription of the Preferred Erobodlments 
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The present invention relates to AUR-1 and/or 
AUR-2 polypeptides, nucleic acids encoding such 
polypeptides, cells, tissues and animals containing such 
nucleic acids, antibodies to such polypeptides, assays 
utilizing such polypeptides, and methods relating to all 
of the foregoing. 

I. Nucleic Acid Encoding AUR-1 and/or AUR-2 
Polypeptides • 

Included within the scope of this invention 
are the functional equivalents of the herein-described 
isolated nucleic acid molecules. The degeneracy of the 
genetic code permits substitution of certain codons by 
other codons which specify the same amino acid and hence 
would give rise to the same protein. The nucleic acid 
sequence can vary substantially since, with the 
exception of methionine and tryptophan, the known amino 
acids can be coded for by more than one codon. Thus, 
portions or all of the AUR-1 and/or AUR-2 gene could be 
synthesized to give a nucleic acid sequence 
significantly different from that shown in SEQ ID NO:l 
or SEQ ID NO: 2. The encoded amino acid sequence thereof 
would, however, be preserved. 

In addition, the nucleic acid sequence may 
comprise a nucleotide sequence which results from the 
addition, deletion or substitution of at least one 
nucleotide to the 5 » -end and/or the 3 ' -end of the 
nucleic acid formula shown in SEQ ID N0:1 or SEQ ID NO: 2 
or a derivative thereof. Any nucleotide or 
polynucleotide may be used in this regard, provided that 
its addition, deletion or substitution does not alter 
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the amino acid sequence of SEQ ID NO; 3 or SEQ ID NO: 4 
which is encoded by the nucleotide sequence. For 
example, the present invention is intended to include 
any nucleic acid sequence resulting from the addition of 
ATG as an initiation codon at the 5 '-end of the 
5 inventive nucleic acid sequence or its derivative, or 
from the addition of TTA, TAG or TGA as a termination 
codon at the 3 ' -end of the inventive nucleotide sequence 
or its derivative. Moreover, the nucleic acid molecule 
of the present invention may, as necessary, have 

10 restriction endonuclease recognition sites added to its 
5 '--end cmd/or 3 '-end- 
Such functional alterations of a given nucleic 
acid sequence afford an opportunity to promote secretion 
and/or processing of heterologous proteins encoded by 

15 foreign nucleic acid sequences fused thereto. All 
variations of the nucleotide sequence of the AUR-1 
and/or AUR-2 genes and fragments thereof permitted by 
the genetic code are, therefore, included in this 
invention. 

20 Further, it is possible to delete codons or to 

substitute one or more codons by codons other than 
degenerate codons to produce a structurally modified 
polypeptide, but one which has substantially the same 
utility or activity of the polypeptide produced by the 

25 unmodified nucleic acid molecule. As recognized in the 
art, the two polypeptides are functionally equivalent, 
as are the two nucleic acid molecules which give rise to 
their production, even though the differences between 
the nucleic acid molecules are not related to degeneracy 

30 of the genetic code. 
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II. A Nucl ic Acid Prob for the Detection of AUR-1 
and/or AUR-2. 

A nucleic acid probe of the present invention 
may be used to probe an appropriate chromosomal or cDNA 
library by usual hybridization methods to obtain another 
nucleic acid molecule of the present invention. A 
chromosomal DNA or cDNA library may be prepared from 
appropriate cells according to recognized methods in the 
art (cf . "Molecular Cloning: A Laboratory Manual", 
second edition, edited by Sambrook, Fritsch, & Maniatis, 
Cold Spring Harbor Laboratory, 1989) . 

In the alternative, chemical synthesis is 
carried out in order to obtain nucleic acid probes 
having nucleotide sequences which correspond to N- 
terminal and C- terminal portions of the amino acid 
sequence of the polypeptide of interest. Thus, the 
synthesized nucleic acid probes may be used as primers 
in a polymerase chain reaction (PCR) carried out in 
accordance with recognized PCR techniques, essentially 
according to PCR Protocols, "A Guide to Methods and 
Applications", edited by Michael et al.. Academic Press, 
1990, utilizing the appropriate chromosomal or cDNA 
library to obtain the fragment of the present invention. 

One skilled in the art can readily design such 
probes based on the sequence disclosed herein using 
methods of computer alignment and sequence analysis 
known in the art (cf . "Molecular Cloning: A Laboratory 
Manual", second edition, edited by Sambrook, Fritsch, & 
Maniatis, Cold Spring Harbor Laboratory, 1989) . The 
hybridization probes of the present invention can be 
labeled by standard labeling techniques such as with a 
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radiolabel, enzyme label, fluorescent label, biotin- 
avidin label, chemiluminescence, and the like. After 
hybridization, the probes may be visualized using known 
methods . 

The nucleic acid probes of the present 
5 invention include RNA, as well as DNA probes, such 

probes being generated using techniques known in the 
art . The nucleic acid probe may be immobilized on a 
solid support. Examples of such solid supports include, 
but are not limited to, plastics such as polycarbonate, 

10 complex carbohydrates such as agarose and sepharose, and 
acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such 
solid supports are well known in the art. 

The test samples suitable for nucleic acid 

15 probing methods of the present invention include, for 
example, cells or nucleic acid extracts of cells, or 
biological fluids. The sample used in the above- 
described methods will vary based on the assay format, 
the detection method and the nature of the tissues, 

20 cells or extracts to be assayed. Methods for preparing 
nucleic acid extracts of cells are well known in the art 
and can be readily adapted in order to obtain a sample 
which is compatible with the method utilized. 



25 III. A Probe Based Method And Kit For 
Detecting AUR-l and/or AUR-2. 

One method of detecting the presence of AUR-1 
and/or AUR-2 in a sample comprises (a) contacting said 
sample with the above -de scribed nucleic acid probe, 
30 under conditions such that hybridization occurs, and (b) 
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detecting the presence of said probe bound to said 
nucleic acid molecule. One skilled in the art would 
select the nucleic acid probe according to techniques 
known in the art as described above. Samples to be 
tested include but should not be limited to RNA samples 
of human tissue. 

A kit for detecting the presence of AUR-1 
and/or AUR-2 in a sample comprises at least one 
container means having disposed therein the above - 
described nucleic acid probe. The kit may further 
comprise other containers comprising one or more of the 
following: wash reagents and reagents capable of 
detecting the presence of bound nucleic acid probe. 
Examples of detection reagents include, but are not 
limited to radiolabelled probes, enzymatic labeled 
probes (horseradish peroxidase, alkaline phosphatase) , 
and affinity labeled probes (biotin, avidin, or 
steptavidin) . 

In detail, a compartmentalized kit includes 
any kit in which reagents are contained in separate 
containers. Such containers include small glass 
containers, plastic containers or strips of plastic or 
paper. Such containers allow the efficient transfer of 
reagents from one compartment to another compartment 
such that the samples and reagents are not cross - 
contaminated and the agents or solutions of each 
container can be added in a quantitative fashion from 
one compartment to another. Such containers will 
include a container which will accept the test sample, a 
container which contains the probe or primers used in 
the assay, containers which contain wash reagents (such 
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as phosphate buffered saline, Tris-buf f ers , and the 
like) , and containers which contain the reagents used to 
detect the hybridized probe, bound antibody, amplified 
product, or the like. One skilled in the art will 
readily recognize that the nucleic acid probes described 
5 in the present invention can readily be incorporated 

into one of the established kit formats which are well 
known in the art. 

IV. DNA Constructs Con^rising a AUR-l and/ or AUR-2 
10 Nucleic Acid Molecule and Cells Containing These 
Constructs. 

The present invention also relates to a 
recombinant DNA molecule comprising, 5' to 3', a 
promoter effective to initiate transcription in a host 

15 cell and the above -described nucleic acid molecules. In 
addition, the present invention relates to a recombinant 
DNA molecule comprising a vector and an above -de scribed 
nucleic acid molecules. The present invention also 
relates to a nucleic acid molecule comprising a 

20 transcriptional region functional in a cell, a sequence 
conplimentary to an RNA sequence encoding an amino acid 
sequence corresponding to the above -described 
polypeptide, and a transcriptional termination region 
functional in said cell. The above -de scribed molecules 

25 may be isolated and/or purified DNA molecules. 

The present invention also relates to a cell 
or organism that contains an above -described nucleic 
acid molecule and thereby is capable of expressing a 
peptide. The polypeptide may be purified from cells 

30 which have been altered to express the polypeptide. A 
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cell is said to be "altered to express a desired 
polypeptide" when the cell, through genetic 
manipulation, is made to produce a protein which it 
normally does not produce or which the cell normally 
produces at lower levels. One skilled in the art can 
readily adapt procedures for introducing and expressing 
either genomic, cDNA, or synthetic sequences into either 
eukaryotic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said 
to be "capable of expressing" a polypeptide if it 
contains nucleotide sequences which contain 
transcriptional and translational regulatory information 
and such sequences are "operably linked" to nucleotide 
sequences which encode the polypeptide. An operable 
linkage is a linkage in which the regulatory DNA 
sequences and the DNA sequence sought to be expressed 
are connected in such a way as to permit gene sequence 
expression. The precise nature of the regulatory 
regions needed for gene sequence expression may vary 
from organism to organism, but shall in general include 
a promoter region which, in prokaryotes, contains both 
the promoter (which directs the initiation of RNA 
transcription) as well as the DNA sequences which, when 
transcribed into RNA, will signal synthesis initiation. 
Such regions will normally include those 5 • -non-coding 
sequences involved with initiation of transcription and 
translation, such as the TATA box, capping sequence, 
CAAT sequence, and the like. 

If desired, the non-coding region 3' to the 
sequence encoding an AUR-1 and/or AUR-2 gene may be 
obtained by the above -described methods. This region 
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may be retained for its transcriptional termination 
regulatory sequences, such as termination and 
polyadenylation . Thus, by retaining the 3 • -region 
naturally contiguous to the DNA secpaence encoding an 
AUR-1 and/or ADR-2 gene, the transcriptional termination 
signals may be provided. Where the transcriptional 
termination signals are not satisfactorily functional in 
the expression host cell, then a 3* region functional in 
the host cell may be substituted. 

Two DNA sequences (such as a promoter region 
sequence and an ADR-1 and/or AXJR-2 sequence) are said to 
be operably linked if the nature of the linkage between 
the two DNA sequences does not (1) result in the 
introduction of a frame-shift mutation, (2) interfere 
with the ability of the promoter region sequence to 
direct the transcription of an AUR-1 and/ or AUR-2 gene 
sequence, or (3) interfere with the ability of the an 
AUR-1 and/or AUR-2 gene sequence to be transcribed by 
the promoter region sequence. Thus, a promoter region 
would be operably linked to a DNA sequence if the 
promoter were capable of effecting transcription of that 
DNA sequence. Thus, to express an AUR-1 and/or AUR-2 
gene, transcriptional and translational signals 
recognized by an appropriate host are necessary. 

The present invention encompasses the 
expression of the AUR-1 and/or AUR-2 gene (or a 
functional derivative thereof) in either prokaryotic or 
eukaryotic cells. Prokaryotic hosts are, generally, 
very efficient and convenient for the production of 
recombinant proteins and are, therefore, one type of 
preferred expression system for the AUR-1 and/or AUR-2 
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gene. Prokaryotes most frequently are represented by 
various strains of E. coli . However, other microbial 
strains may also be used, including other bacterial 
strains . 

In prokaryotic systems, plasmid vectors that 
contain replication sites and control sequences derived 
from a species conqoatible with the host may be used. 
Examples of suitable plasmid vectors may include pBR322, 
pUClie, PUC119 and the like; suitable phage or 
bacteriophage vectors may include ygtlO, ygtll and the 
like; and suitable virus vectors may include pMAM-neo, 
pKRC and the like. Preferably, the selected vector of 
the present invention has the capacity to replicate in 
the selected host cell. 

Recognized prokaryotic hosts include bacteria 
such as E. coil. Bacillus, Streptomyces, Pseudomonas, 
Salmonella, Serratia, and the like. However, under such 
conditions, the peptide will not be glycosylated. The 
prokaryotic host must be compatible with the replicon 
and control sequences in the expression plasmid. 

To express AUR-i and/or AUR-2 (or a functional 
derivative thereof) in a prokaryotic cell, it is 
necessary to operably link the AUR-i and/or AUR-2 
sequence to a functional prokaryotic promoter. Such 
promoters may be either constitutive or, more 
preferably, regulatable (i.e., inducible or 
derepressible) . Examples of constitutive promoters 
include the int promoter of bacteriophage X, the bla 
promoter of the p-lactamase gene sequence of pBR322, and 
the CAT promoter of the chloramphenicol acetyl 
transferase gene sequence of pPR325, and the like. 
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Examples of inducible prokaryotic promoters include the 
major right and left promoters of bacteriophage X (Pj, and 
Pg), the trp, recA, XacZ, XacI, and gal promoters of E. 
coli, the a-amylase (Ulmanen et at., J . Bacterlol t 
152:176-182(1985)) and the g-28-specif ic promoters of B. 
subtilis (Oilman et al . , Gene sequence 32:11-20(1984)), 
the promoters of the bacteriophages of Bacillus 
(Gryczan, In: The Molecular Biology of the Bacilli, 
Academic Press, Inc., NY (1982)), and Streptomyces 
promoters (Ward et at., MoT fiftn. ggnet. 203:468- 
478(1986)) . Prokaryotic promoters are reviewed by Click 
( ^T , Tr.fl M-i^r-oh-iot. 1:277-282(1987)); Cenatiempo 
f n-io^hiinie 58:505-516(1986)); and Gottesman ( ftnn. Rev. 
nt^net. Ig: 415-442 (1984)). 

Proper expression in a prokaryotic cell also 
15 requires the presence of a ribosome binding site 

upstream of the gene sequence -encoding sequence. Such 
ribosome binding sites are disclosed, for example, by 
Gold et at. fftnn ^""^ Mirrobiol. 35:365-404(1981)). 
The selection of control sequences, expression vectors, 
transformation methods, and the like, are dependent on 
the type of host cell used to express the gene. As used 
herein, "cell", "cell line", and "cell culture" may be 
used interchangeably and all such designations include 
progeny. Thus, the words "transformants" or 
"transformed cells" include the primary subject cell and 
cultures derived therefrom, without regard to the number 
of transfers. It is also understood that all progeny 
may not be precisely identical in DNA content, due to 
deliberate or inadvertent mutations. However, as 
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defined, mutant progeny have the same functionality as 
that of the originally transformed cell. 

Host cells which may be used in the expression 
systems of the present invention are not strictly 
limited, provided that they are suitable for use in the 
expression of the ADR-l and/or AUR-2 peptide of 
interest. Suitable hosts may often include eukaryotic 
cells. Preferred eukaryotic hosts include, for example, 
yeast, fungi, insect cells, mammalian cells either in 
vivo, or in tissue culture. Mammalian cells which may 
be useful as hosts include HeLa cells, cells of 
fibroblast origin such as VERO or CHO-Kl, or cells of 
lymphoid origin and their derivatives. Preferred 
mammalian host cells include SP2/0 and J558L, as well as 
neuroblastoma cell lines such as IMR 332 which may 
provide better capacities for correct post-translational 
processing. 

In addition, plant cells are also available as 
hosts, and control sequences compatible with plant cells 
are available, such as the cauliflower mosaic virus 35S 
and 19S, and nopaline synthase promoter and 
polyadenylation signal sequences. Another preferred 
host is an insect cell, for example the Drosophila 
larvae. Using insect cells as hosts, the Drosophila 
alcohol dehydrogenase promoter can be used. Rubin, 
ScifiZlSfi 240:1453-1459(1988). Alternatively, baculovirus 
vectors can be engineered to express large amounts of 
ADR-l and/or AUR-2 in insects cells (Jasny, Sci^no^ 
238:1653 (1987); Miller et al . , In: Genetic Engineering 
(1986), setlow, J.K., et al., eds.. Plenum, Vol. 8, pp. 
277-297) . 
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Any of a series of yeast gene sequence 
expression systems can be utilized which incorporate 
promoter and termination elements from the actively 
expressed gene sequences coding for glycolytic enzymes 
are produced in large quantities when yeast are grown in 
5 mediums rich in glucose. Known glycolytic gene sequences 
can also provide very efficient transcriptional control 
signals. Yeast provides substantial advantages in that 
it can also carry out post-translational peptide 
modifications. A number of recombinant DNA strategies 
10 exist which utilize strong promoter sequences and high 
copy number of plasmids which can be utilized for 
production of the desired proteins in yeast. Yeast 
recognizes leader sequences on cloned mammalian gene 
sequence products and secretes peptides bearing leader 
15 sequences {i.e., pre -pept ides ) . For a mammalian host, 
several possible vector systems are available for the 
expression of AXJR-1 and/ or AUR-2. 

A wide variety of transcriptional and 
translational regulatory sequences may be employed, 
20 depending upon the nature of the host. The 

transcriptional and translational regulatory signals may 
be derived from viral sources, such as adenovirus, 
bovine papilloma virus, cytomegalovirus, simian virus, 
or the like, where the regulatory signals are associated 
25 with a particular gene sequence which has a high level 
of expression. Alternatively, promoters from mammalian 
expression products, such as actin, collagen, myosin, 
and the like, may be employed. Transcriptional 
initiation regulatory signals may be selected which 
allow for repression or activation, so that expression 
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of the gene sequences can be modulated. Of interest are 
regulatory signals which are temperature-sensitive so 
that by varying the temperature, expression can be 
repressed or initiated, or are subject to chemical (such 
as metabolite) regulation. 

Expression of AUR-1 and/or AUR-2 in eukaryotic 
hosts requires the use of eukaryotic regulatory regions. 
Such regions will, in general, include a promoter region 
sufficient to direct the initiation of RNA synthesis. 
Preferred eukaryotic promoters include, for example, the 
promoter of the mouse metal lothionein I gene sequence 
(Hamer et al., J, Mol . Anpl . n^^n 1:273-288(1982)); the 
TK promoter of Herpes virus (McKnight, Cell 31:355-365 
(1982)); the SV40 early promoter (Benoist et al., Natur^^ 
(London) 290:304-310(1981)); the yeast gal4 gene 
sequence promoter (Johnston et al . , Proc. Natl, ^r^ tfl 
Sci. (USA) 79:6971-6975(1982); Silver et al . , Proc, 

Nfltl. Agad, ggir. (USA) 51:5951-5955 (1984)). 

Translation of eukaryotic mRNA is initiated at 
the codon which encodes the first methionine. For this 
reason, it is preferable to ensure that the linkage 
between a eukaryotic promoter and a DNA sequence which 
encodes AUR-1 and/or ADR-2 (or a functional derivative 
thereof) does not contain any intervening codons which 
are capable of encoding a methionine (i.e., AUG). The 
presence of such codons results either in a formation of 
a fusion protein (if the AUG codon is in the same 
reading frame as the AUR-1 and/or AUR-2 coding sequence) 
or a frame-shift mutation (if the AUG codon is not in 
the same reading frame as the AUR-l and/or AUR-2 coding 
sequence) . 
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A AUR-1 and/or AUR-2 nucleic acid molecule and 
an operably linked promoter may be introduced into a 
recipient prokaryotic or eukaryotic cell either as a 
nonreplicating DNA (or RNA) molecule, which may either 
be a linear molecule or, more preferably, a closed 

5 covalent circular molecule. Since such molecules are 
incapable of autonomous replication, the expression of 
the gene may occur through the transient expression of 
the introduced sequence. Alternatively, permanent 
expression may occur through the integration of the 

10 introduced DNA sequence into the host chromosome, 

A vector may be employed which is capable of 
integrating the desired gene sequences into the host 
cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by 

15 also introducing one or more markers which allow for 
selection of host cells which contain the expression 
vector. The marker may provide for prototrophy to an 
auxotrophic host, biocide resistance, e.g., antibiotics, 
or heavy metals, such as copper, or the like. The 

20 selectable marker gene sequence can either be directly 
linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transf ection. 
Additional elements may also be needed for optimal 
synthesis of single chain binding protein mRNA. These 

25 elements may include splice signals, as well as 

transcription promoters, enhancers, and termination 
signals- cDNA expression vectors incorporating such 
elements include those described by Okayama, Molec r 
r^M . Biol. 3:280(1983). 
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The introduced nucleic acid molecule can be 
incorporated into a plasmid or viral vector capable of 
autonomous replication in the recipient host. Any of a 
wide variety of vectors may be employed for this 
purpose. Factors of inportance in selecting a 
particular plasmid or viral vector include: the ease 
with which recipient cells that contain the vector may 
be recognized and selected from those recipient cells 
which do not contain the vector; the number of copies of 
the vector which are desired in a particular host; and 
whether it is desirable to be aible to "shuttle" the 
vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids 
such as those capable of replication in E. coil (such 
as, for example, pBR322, ColEl, pSClOl, pACYC 184, nVX. 
Such plasmids are, for example, disclosed by Sambrook 
(cf . "Molecular Cloning: A Laboratory Manual", second 
edition, edited by Sambrook, Fritsch, & Maniatis, Cold 
Spring Harbor Laboratory, (1989)). Bacillus plasmids 
include pC194, pC221, pT127, and the like. Such plasmids 
are disclosed by Gryczan (In: The Molecular Biology of 
the Bacilli, Academic Press, NY (1982), pp. 307-329). 
Suitable Streptomyces plasmids include plJloi (Kendall 

al., J , Bacteriol , I^9:4177-4183 (1987)), and 
streptomyces bacteriophages such as 0C31 (Chater et al.. 
In: Sixth International Symposium on Actinomycetales 
Biology, Akademiai Kaido, Budapest, Hungary (1986), pp. 
45-54) . Pseudomonas plasmids are reviewed by John et al. 
(R^V. Infect, Pis. a:693-704 (1986) ) , and Izaki ( Jon. .T 
Bacte-r-inl 33:729-742(1978)). 
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Preferred eukaryotic plasmids include, for 
example, BPV, vaccinia, SV40, 2 -micron circle, and the 
like, or their derivatives- Such plasmids are well known 
in the art (Botstein et al., Miami Wntr. Svmp, 19:265- 
274(1982); Broach, In: "The Molecular Biology of the 
Yeast Saccharomyces: Life Cycle and Inheritance", Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY, p. 
445-470 (1981); Broach, Cell 25:203-204 (1982); Bollon 
et at., .T r^^n■ HRmatol ■ Oncol. 20:39-48 (1980); 
Maniatis, In: Cell Biology: A Comprehensive Treatise, 
Vol. 3, Gene Sequence Expression, Academic Press, NY, 
pp. 563-608 (1980) . 

Once the vector or nucleic acid molecule 
containing the construct (s) has been prepared for 
expression, the DNA construct (s) may be introduced into 
an appropriate host cell by any of a variety of suitable 
means, i,e., transformation, transf ection, conjugation, 
protoplast fusion, electroporation, particle gun 
technology, calcium phosphate -precipitation, direct 
microinjection, and the like. After the introduction of 
the vector, recipient cells are grown in a selective 
medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene 
molecule (s) results in the production of AUR-1 and/or 
AUR-2 or fragments thereof. This can take place in the 
transformed cells as such, or following the induction of 
these cells to differentiate (for example, by 
administration of bromodeoxyuracil to neuroblastoma 
cells or the like) . A variety of incubation conditions 
can be used to form the peptide of the present 
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invention. The most preferred conditions are those which 
mimic physiological conditions. 

V. Purified AUR-1 and/or AUR-2 Polypeptides 

A variety of methodologies known in the art 
can be utilized to obtain the peptide of the present 
invention. The peptide may be purified from tissues or 
cells which naturally produce the peptide. 
Alternatively, the above-described isolated nucleic acid 
fragments could be used to expressed the AUR-1 and/or 
AUR-2 protein in any organism. The samples of the 
present invention include cells, protein extracts or 
membrane extracts of cells, or biological fluids. The 
sample will vary based on the assay format, the 
detection method and the nature of the tissues, cells or 
extracts used as the san^le. 

Any eukaryotic organism can be used as a 
source for the peptide of the invention, as long as the 
source organism naturally contains such a peptide. As 
used herein, "source organism" refers to the original 
organism from which the amino acid sequence of the 
subunit is derived, regardless of the organism the 
subunit is expressed in and ultimately isolated from. 

One skilled in the art can readily follow 
known methods for isolating proteins in order to obtain 
the peptide free of natural contaminants. These 
include, but are not limited to: size-exclusion 
chromatography, HPLC, ion- exchange chromatography, and 
immuno- affinity chromatography. 
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VI. An Antibody Having Binding Affinity To A AUR-1 
and/or AUR-2 Polypeptide And A Hybrldoma Containing the 
Antibody. 

The present invention relates to an antibody 
having binding affinity to a AUR-1 and/or AUR-2 
5 polypeptide. The polypeptide may have the amino acid 
sequence set forth in SEQ ID NO: 3 or SEQ ID NO:4, or 
functional derivative thereof, or at least 9 contiguous 
amino acids thereof (preferably, at least 20, 30, 35, or 
4 0 contiguous amino acids thereof) . 
10 The present invention also relates to an 

antibody having specific binding affinity to an AUR-1 
and/or AUR-2 polypeptide. Such an antibody may be 
isolated by comparing its binding affinity to a AUR-1 
and/or AUR-2 polypeptide with its binding affinity to 
15 another polypeptide- Those which bind selectively to 
AUR-l and/or AUR-2 would be chosen for use in methods 
requiring a distinction between AUR-1 and/or AUR-2 and 
other polypeptides. Such methods could include, but 
should not be limited to, the analysis of altered AUR-1 
20 and/or AUR-2 expression in tissue containing other 
polypeptides. 

The AUR-1 and/or AUR-2 proteins of the present 
invention can be used in a variety of procedures and 
methods, such as for the generation of antibodies, for 
25 use in identifying pharmaceutical compositions, and for 
studying DNA/protein interaction. 

The AUR-1 and/or AUR-2 peptide of the present 
invention can be used to produce antibodies or 
hybridomas. One skilled in the art will recognize that 
30 if an antibody is desired, such a peptide would be 
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generated as described herein and used as an imraunogen. 
The antibodies of the present invention include 
monoclonal and polyclonal antibodies, as well fragments 
of these antibodies, and humanized forms. Humanized 
forms of the antibodies of the present invention may be 
generated using one of the procedures known in the art 
such as chimerization or CDR grafting. The present 
invention also relates to a hybridoma which produces the 
above-described monoclonal antibody, or binding fragment 
thereof. A hybridoma is an immortalized cell line which 
is capable of secreting a specific monoclonal antibody. 

In general, techniques for preparing 
monoclonal antibodies and hybridomas are well known in 
the art (Campbell, "Monoclonal Antibody Technology: 
liaboratory Techniques in Biochemistry and Molecular 
Biology," Elsevier Science Publishers, Amsterdam, The 
Netherlands (1984); St. Groth et al , , J. Imtntinn] . 
Methods 35:1-21(1980)). Any animal (mouse, rabbit, and 
the like) which is known to produce antibodies can be 
immunized with the selected polypeptide. Methods for 
immunization are well known in the art. Such methods 
include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that 
the amount of polypeptide used for immunization will 
vary based on the animal which is immunized, the 
antigenicity of the polypeptide and the site of 
injection. 

The polypeptide may be modified or 
administered in an adjuvant in order to increase the 
peptide antigenicity. Methods of increasing the 
antigenicity of a polypeptide are well known in the art. 
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Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or (3- 
galactosidase) or through the inclusion of an adjuvant 
during immuni zat ion . 

For monoclonal antibodies, spleen cells from 

5 the immunized animals are removed, fused with myeloma 

cells, such as SP2/0-Agl4 myeloma cells, and allowed to 
become monoclonal antibody producing hybridoma cells. 
Any one of a number of methods well known in the art can 
be used to identify the hybridoma cell which produces an 

0 antibody with the desired characteristics. These include 
screening the hybridomas with an ELISA assay, western 
blot analysis, or radioimmunoassay (Lutz et al., Eso^ 
rp>ll Res. 275:109-124(1988)). Hybridomas secreting the 
desired antibodies are cloned and the class and subclass 

.5 is determined using procedures Icnown in the art 

(Campbell, "Monoclonal Antibody Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology", supra 
(1984) ) . 

For polyclonal antibodies, antibody containing 
20 antisera is isolated from the immunized animal and is 

screened for the presence of antibodies with the desired 
specificity using one of the above -described procedures. 
The above -described antibodies may be detect ably 
labeled. Antibodies can be detectably labeled through 
25 the use of radioisotopes, affinity labels (such as 

biotin, avidin, and the like) , enzymatic labels (such as 
horse radish peroxidase, alkaline phosphatase, and the 
like) fluorescent labels (such as FITC or rhodamine, and 
the like), paramagnetic atoms, and the like. Procedures 
30 for accomplishing such labeling are well-known in the 
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art, for example, see (Stemberger et al., J. Hi g^r^^^ f >^ n 
Cytgghem. ie:3i5 (1970) ; Bayer et at., Meth, v.rf^ ,yin 
62:308 (1979); Engval et al., IimnuTioh 105:129(1972); 
Go<iing/ J. Immunol. M^^h 23:215(1976)). The labeled 
antibodies of the present invention can be used for in 
vitro, in vivo, and in situ assays to identify cells or 
tissues which express a specific peptide. 

The above -described antibodies may also be 
immobilized on a solid support. Examples of such solid 
supports include plastics such as polycarbonate, complex 
carbohydrates such as agarose and sepharose, acrylic 
resins and such as polyacrylamide and latex beads. 
Techniques for coupling antibodies to such solid 
supports are well known in the art (Weir et al . , 
"Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Pxibli cat ions, Oxford, England, Chapter 
10(1986); Jacoby et al., Meth . r.r^^ym 34^ Academic 
Press, N.Y. (1974)). The immobilized antibodies of the 
present invention can be used for in vitro, in vivo, and 
in situ assays as well as in immunochromotography. 

Furthermore, one skilled in the art can 
readily adapt currently available procedures, as well as 
the techniques, methods and kits disclosed above with 
regard to antibodies, to generate peptides capable of 
binding to a specific peptide sequence in order to 
generate rationally designed antipeptide peptides, for 
example see Hurby et al., "Application of Synthetic 
Peptides: Antisense Peptides", In Synthetic Peptides, A 
User's Guide, W.H. Freeman, NY, pp. 289-307(1992), and 
Kaspczak et al., Biochemistry 28:9230-8(1989). 
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Ant i -peptide peptides can be generated by 
replacing the basic amino acid residues found in the 
ADR-1 and/or ADR- 2 peptide sequence with acidic 
residues, while maintaining hydrophobic and uncharged 
polar groups. For example, lysine, arginine, and/or 
5 histidine residues are replaced with aspartic acid or 

glutamic acid and glutamic acid residues are replaced by 
lysine, arginine or histidine - 

VII. An Antibody Based Method And Kit For Detecting 

10 AOR-1 and/or AUR-2. 

The present invention encompasses a method of 
detecting an AUR-1 and/or AUR-2 polypeptide in a sample, 
comprising: (a) contacting the sample with an above- 
described antibody, under conditions such that 
15 immunocomplexes form, and (b) detecting the presence of 
said antibody bound to the polypeptide. In detail, the 
methods comprise incubating a test sample with one or 
more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. 
20 Altered levels of AUR-1 and/or ADR-2 in a sample as 
compared to normal levels may indicate disease. 

Conditions for incubating an antibody with a 
test sample vary. Inc\ibation conditions depend on the 
format employed in the assay, the detection methods 
25 employed, and the type and nature of the antibody used 

in the assay. One skilled in the art will recognize that 
any one of the commonly available immunological assay 
formats (such as radioimmunoassays, enzyme-linked 
immunosorbent assays, diffusion based Ouchterlony, or 
30 rocket immunofluorescent assays) can readily be adapted 
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to employ the antibodies of the present invention. 
Examples of such assays can be found in Chard, "i\n 
Introduction to Radioimmunoassay and Related Techniques" 
Elsevier Science Publishers, Amsterdam, The Netherlands 
(1986); Bullock et al., "Techniques in 

Immunocytochemistry, " Academic Press, Orlando, FL Vol. 
1(1982), Vol, 2 (1983), Vol, 3 (1985); Tijssen, 
"Practice and Theory of Enzyme Immunoassays: Laboratory 
Techniques in Biochemistry and Molecular Biology, " 
Elsevier Science Publishers, Amsterdam, The Netherlands 
(1985) . 

The immunological assay test samples of the 
present invention include cells, protein or membrane 
extracts of cells, or biological fluids such as blood, 
serum, plasma, or urine. The test sample used in the 
above -described method will vary based on the assay 
format, nature of the detection method and the tissues, 
cells or extracts used as the sample to be assayed. 
Methods for preparing protein extracts or membrane 
extracts of cells are well known in the art and can be 
readily be adapted in order to obtain a sample which is 
capable with the system utilized. 

A kit contains all the necessary reagents to 
carry out the previously described methods of detection. 
The kit may comprise: (i) a first container means 
containing an above -described antibody, and (ii) second 
container means containing a conjugate comprising a 
binding partner of the antibody and a label. In another 
preferred embodiment, the kit further comprises one or 
more other containers comprising one or more of the 
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following: wash reagents and reagents capable of 
detecting the presence of bound stntibodies. 

Examples of detection reagents include, but 
are not limited to, labeled secondary antibodies, or in 
the alternative, if the primary antibody is labeled, the 
5 chromophoric, enzymatic, or antibody binding reagents 

which are capable of reacting with the labeled antibody. 
The compartmentalized kit may be as described above for 
nucleic acid probe kits. One skilled in the art will 
readily recognize that the antibodies described in the 
10 present invention can readily be incoirporated into one 
of the established kit formats which are well known in 
the art. 

VXII. Isolation of Compounds Which Interact With AUR-1 
15 and/or ADR- 2. 

The present invention also relates to a method 
of detecting a compound capable of binding to a AUR-l 
and/ or AUR-2 polypeptide comprising incubating the 
compound with AUR-l and/or AUR-2 and detecting the 

20 presence of the compound bound to AUR-l and/ or AUR-2, 
The compound may be present within a complex mixture, 
for example, serum, body fluid, or cell extracts. 

The present invention also relates to a method 
of detecting an agonist or antagonist of AUR-l and/or 

25 AUR-2 activity or AUR-l and/or AUR-2 binding partner 

activity comprising incubating cells that produce AUR-l 
Euid/or AUR-2 in the presence of a compound and detecting 
changes in the level of AUR-l and/or AUR-2 activity or 
AUR-l and/or AUR-2 binding partner activity. The 

30 compounds thus identified would produce a change in 
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activity indicative of the presence of the compound. 
The compound may be present within a conplex mixture, 
for example, serum, body fluid, or cell extracts. Once 
the con^jound is identified it can be isolated using 
techniques well known in the art. 

The present invention also encompasses a 
method of agonizing (stimulating) or antagonizing AUR-i 
and/or AUR-2 associated activity in a mammal comprising 
administering to said mammal an agonist or antagonist to 
AUR-l and/or AUR-2 in an amount sufficient to effect 
said agonism or antagonism. A method of treating 
diseases in a mammal with an agonist or antagonist of 
ADR-1 and/or AUR-2 related activity comprising 
administering the agonist or antagonist to a mammal in 
an amount sufficient to agonize or antagonize AUR-i 
and/or AtJR-2 associated functions is also encompassed in 
the present application. 



ZX. Tremsgenic Animals. 

A variety of methods are available for the 
production of transgenic animals associated with this 
invention. DNA can be injected into the pronucleus of a 
fertilized egg before fusion of the male and female 
pronuclei, or injected into the nucleus of an embryonic 
cell the nucleus of a two-cell embryo) following 

the initiation of cell division {Brinster £t ai^, proc. 
Wflt . ACf^d. Sqx. USft 82: 4438-4442 (1985)). Embryos can 
be infected with viruses, especially retroviruses, 
modified to carry inorganic- ion receptor nucleotide 
seq[uences of the invention. 
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Pluripotent stem cells derived from the inner 
cell mass of the embryo and stabilized in culture can be 
manipulated in culture to incorporate nucleotide 
sequences of the invention. A transgenic animal can be 
produced from such cells through implantation into a 
5 blastocyst that is implanted into a foster mother and 
allowed to come to term. TUiimals suitable for 
trcuisgenic experiments can be obtained from standard 
commercial sources such as Charles River (Wilmington, 
MA) , Taconic (Germantovm, NY) , Harlan Sprague Dawley 
10 (Indianapolis, IN), etc. 

The procedures for manipulation of the rodent 
embryo and for microinjection of DNA into the pronucleus 
of the zygote are well known to those of ordinary skill 
in the art (Hogan al- . .giifiXS.) . Microinjection 
15 procedures for fish, amphibian eggs and birds are 

detailed in Houdebine and Chourrout, Bxpgri^nt j-^ 47: 
897-905 (1991) • Other procedures for introduction of 
DNA into tissues of animals are described in U.S. Patent 
No., 4,945,050 (Sandford fit aI- r July 30, 1990). 
20 By way of example only, to prepare a 

transgenic mouse, female mice are induced to 
superovulate . Females are placed with males, and the 
mated females are sacrificed by CO2 asphyxiation or 
cervical dislocation and embryos are recovered from 
25 excised oviducts- Surrounding cumulus cells are 

removed. Pronuclear embryos are then washed and stored 
until the time of injection. Randomly cycling adult 
female mice are paired with vasectomized males. 
Recipient females are mated at the same time as donor 
3 0 females. Embryos then are transferred surgically. The 
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procedure for generating transgenic rats is similar to 
that of mice. See Hammer £t al- . Cell 63:1099-1112 
(1990) . 

Methods for the culturing of embryonic stem 
(ES) cells and the subsequent production of transgenic 
animals by the introduction of DNA into ES cells using 
methods such as electroporation, calcium phosphate/DNA 
precipitation and direct injection also are well known 
to those of ordinary skill in the art. See, for 
example, TeratPcarrinomas and EmhT-yon^p fif^i" '^^^^r r\ 
PractAqal Approflrh , E.J. Robertson, ed., IRL Press 
(1987) . 

In cases involving random gene integration, a 
clone containing the sequence (s) of the invention is co- 
transfected with a gene encoding resistance. 
Alternatively, the gene encoding neomycin resistance is 
physically linked to the sequence (s) of the invention. 
Transfection and isolation of desired clones are carried 
out by any one of several methods well known to those of 
ordinary skill in the art (E.J. Robertson, aucra) . 

DNA molecules introduced into ES cells can 
also be integrated into the chromosome through the 
process of homologous recombination. Capecchi, Qsisnss. 
244: 1288-1292 (1989). Methods for positive selection 
of the recombination event (j^, neo resistance) and 
dual positive-negative selection (i^, neo resistance 
and gancyclovir resistance) and the subsequent 
identification of the desired clones by PGR have been 
described by Capecchi, sUEEa and Joyner £t al. , HatUEe 
338: 153-156 (1989), the teachings of which are 
incorporated herein. The final phase of the procedure 
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is to inject targeted ES cells into blastocysts and to 
transfer the blastocysts into pseudopregnant females. 
The resulting chimeric animals are bred and the 
offspring are analyzed by Southern blotting to identify 
individuals that carry the transgene. Procedures for 
the production of non-rodent mammals and other animals 
have been discussed by others. See Houdebine and 
Chourrout. sufita; Pursel sJL al- . Sci^UQ^ 244:1281-1288 
(1989); and Sirams s£. al- , Pi o/TfiffhnQloqy 5:179-183 
(1988) . 

Thus, the invention provides transgenic, 
nonhuman mammals containing a transgene encoding a AUR-1 
and/or AUR-2 polypeptide or a gene effecting the 
expression of a ADR-1 and/or AUR-2 polypeptide. Such 
transgenic nonhuman mammals are particularly useful as 
15 an in vivo test system for studying the effects of 

introducing a AUR-l and/or AUR-2 polypeptide, regulating 
the expression of a AUR-1 and/or AUR-2 polypeptide 
H e. . through the introduction of additional genes, 
antisense nucleic acids, or ribozymes) . 
2Q A "transgenic animal" is an animal having 

cells that contain DNA which has been artificially 
inserted into a cell, which DNA becomes part of the 
genome of the animal which develops from that cell. 
Preferred transgenic animals are primates, mice, rats, 
25 cows, pigs, horses, goats, sheep, dogs and cats. The 

transgenic DNA may encode for a human AUR-l and/or AUR-2 
polypeptide. Native expression in an animal may be 
reduced by providing an amount of ant i- sense RNA or DNA 
effective to reduce expression of the receptor. 

30 
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X. Gene Therapy 

AUR-1 and/or AUR-2 or its genetic sequences 
will also be useful in gene therapy (reviewed in Miller, 
Nature 357:455-460, (1992). Miller states that advances 
have resulted in practical approaches to human gene 
therapy that have demonstrated positive initial results. 
The basic science of gene therapy is described in 
Mulligan, Science 260:926-931, (1993). 

In one preferred embodiment, an expression 
vector containing the AUR-1 and/or AUR-2 coding sequence 
is inserted into cells, the cells are grown in vitro and 
then infused in large numbers into patients. In another 
preferred embodiment, a DNA segment containing a 
promoter of choice (for example a strong promoter) is 
transferred into cells containing an endogenous ADR-l 
and/or AUR-2 in such a manner that the promoter segment 
enhances expression of the endogenous AUR-l and/or AUR-2 
gene (for example, the promoter segment is transferred 
to the cell such that it becomes directly linked to the 
endogenous AUR-l and/or AUR-2 gene) . 

The gene therapy may involve the use of an 
adenovirus containing AUR-l and/or AUR-2 cDNA targeted 
to a tumor, systemic AUR-l and/or AUR-2 increase by 
implantation of engineered cells, injection with AUR-l 
and/or AUR-2 virus, or injection of naked AUR-l and/or 
AUR-2 DNA into appropriate tissues. 

Target cell populations may be modified by 
introducing altered forms of one or more components of 
the protein complexes in order to modulate the activity 
of such complexes. For example, by reducing or inhibit- 
ing a complex component activity within target cells, an 
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abnormal signal transduction event (s) leading to a 
condition may be decreased, inhibited, or reversed - 
Deletion or missense mutants of a component, that retain 
the ability to interact with other components of the 
protein complexes but cannot function in signal trans - 
5 duction may be used to inhibit an abnormal, deleterious 
signal transduction event. 

Expression vectors derived from viruses such 
as retroviruses, vaccinia virus, adenovirus, adeno- 
associated virus, herpes viruses, several RNA viruses, 
10 or bovine papilloma virus, may be used for delivery of 
nucleotide sequences f e.g. > cDNA) encoding recombinant 
AUR-1 and/or AUR-2 protein into the targeted cell 
population ^ e.g. . tumor cells) . Methods which are well 
known to those skilled in the art can be used to 
15 construct recombinant viral vectors containing coding 
sequences. See , for example, the techniques described 

in Maniatis et al . , Molecular Cloning: A LabPratPry 

Manual , Cold Spring Harbor Laboratory, N.Y. (1989) , and 
in Ausubel et al-, riiyyent Protocols in Molecular 
20 Biology , Greene Publishing Associates and Wiley 

Interscience, N.Y. (1989)- Alternatively, recombinant 
nucleic acid molecules encoding protein sequences can be 
used as naked DNA or in reconstituted system ^ ,g, , lipo- 
somes or other lipid systems for delivery to target 
25 cells r see e.g. . Feigner et al.. Nature 337:387-8, 

1989) • Several other methods for the direct transfer of 
plasmid DNA into cells exist for use in human gene 
therapy and involve targeting the DNA to receptors on 
cells by complexing the plasmid DNA to proteins. Se£, 
3 0 Miller, supra. 
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In its simplest form, gene transfer can be 
performed by simply injecting minute amounts of DNA into 
the nucleus of a cell, through a process of 
microinjection. Capecchi MR, Cell 22:479-88 (1980). 
Once recombinant genes are introduced into a cell, they 
5 can be recognized by the cells normal mechanisms for 

transcription and translation, and a gene product will 
be expressed. Other methods have also been attempted 
for introducing DNA into larger numbers of cells. These 
methods include: transf ection, wherein DNA is 
0 precipitated with CaPO^ and taken into cells by 

pinocytosis (Chen C. and Okayama H, Mol. Cell Biol. 
7:2745-52 (1987)); electroporation, wherein cells are 
exposed to large voltage pulses to introduce holes into 
the membrane (Chu G. et al.. Nucleic Acids Res., 
> 15:1311-26 (1987)); lipof ection/liposome fusion, wherein 
DNA is packaged into lipophilic vesicles which fuse with 
a target cell (Feigner PL., et al., Proc. Natl. Acad. 
Sci. USA. 84:7413-7 (1987)); and particle bombardment 
using DNA bound to small projectiles (Yang NS. et al., 
Proc. Natl. Acad. Sci. 87:9568-72 (1990)). Another 
method for introducing DNA into cells is to couple the 
DNA to chemically modified proteins. 

It has also been shown that adenovirus 
proteins are capable of destabilizing endosomes and 
enhancing the uptake of DNA into cells. The admixture 
of adenovirus to solutions containing DNA complexes, or 
the binding of DNA to polylysine covalently attached to 
adenovirus using protein crosslinking agents 
substantially improves the uptake and expression of the 
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recombinant gene. Curiel DT et al., Am, J. Respir. 
Cell. Mol. Biol., 6:247-52 (1992), 

As used herein "gene transfer" means the 
process of introducing a foreign nucleic acid molecule 
into a cell. Gene transfer is commonly performed to 
enable the expression of a particular product encoded by 
the gene. The product may include a protein, 
polypeptide, anti-sense DNA or RNA, or enzymatically 
active RNA. Gene transfer can be performed in cultured 
cells or by direct administration into animals. 
Generally gene transfer involves the process of nucleic 
acid contact with a target cell by non-specific or 
receptor mediated interactions, uptake of nucleic acid 
into the cell through the membrane or by endocytosis, 
and release of nucleic acid into the cytoplasm from the 
plasma membrane or endosome. Expression may require, in 
addition, movement of the nucleic acid into the nucleus 
of the cell and binding to appropriate nuclear factors 
for transcription. 

As used herein "gene therapy" is a form of 
gene transfer and is included within the definition of 
gene transfer as used herein and specifically refers to 
gene transfer to express a therapeutic product from a 
cell in vivo or In vitro. Gene transfer can be 
performed ex vivo on cells which are then transplanted 
into a patient, or can be performed by direct 
administration of the nucleic acid or nucleic acid- 
protein complex into the patient. 

In another preferred embodiment, a vector 
having nucleic acid sequences encoding a AUR-1 and/or 
AUR-2 is provided in which the nucleic acid sequence is 
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expressed only in specific tissue. Methods of achieving 
tissue -specific gene expression as set forth in 
International Publication No. wo 93/09236, filed 
November 3, 1992 and published May 13, 1993. 

In all of the preceding vectors set forth 
above, a further aspect of the invention is that the 
nucleic acid sequence contained in the vector may 
include additions, deletions or modifications to some or 
all of the sequence of the nucleic acid, as defined 
above . 

In another preferred embodiment, a method of 
gene replacement is set forth. "Gene replacement- as 
used herein means supplying a nucleic acid sequence 
which is capable of being expressed in vivo in an animal 
and thereby providing or augmenting the function of an 
endogenous gene which is missing or defective in the 
eUiimal . 



The examples below are non- limiting and are 
merely representative of various aspects and features of 
the present invention. The examples below demonstrate 
the isolation, and characterization of the novel 
proteins AUR-i and AUR-2. 

Protein kinases are one of the largest 
families of eukaryotic proteins with several hundred 
known members. These proteins share a 250-300 amino 
acid domain that can be subdivided into 12 distinct 
subdomains that comprise the common catalytic core 
structure. These conserved protein motifs have recently 
been exploited using PCR-based cloning strategies 
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leading to a significant expansion of the known kinases . 
Multiple alignment of the sequences in the catalytic 
domain of protein kinases and subsequent phylogenetic 
analysis permits their segregation into a phylogenetic 
tree. In this manner, related kinases are clustered 
into distinct branches or subfamilies including: 
tyrosine kinases, cyclic-nucleotide-dependent kinases, 
calcium/calmodulin kinases, cycl in- dependent kinases and 
MAP-kinases, as well as several other less defined 

subfamilies . 

Initially we set out to identify homologues of 

CCK4, a receptor that represents a distinct family of 
tyroline kinases. Multiple alignments suggested CCK4 
was most closely related to ROS and the TRK-family of 
receptor tyrosine kinases. We designed degenerate 
primers to conserved sequences within kinase subdomains 
I and IX of these receptors. Subdomain I is at the 
N-terminus of the kinase domain and contains the 
consensus motif GXGXXGXV which is involved in anchoring 
ATP to the catalytic unit of all classes of kinases, 
subdomain IX contains a nearly invariant Asp which acts 
to stabilize the catalytic loop by bonding to residues 
in subdomain VIB. This invariant Asp and flanking amino 
acids are often used in PCR-cloning strategies as it 
distinguishes tyrosine-kinases (DVWSY/FGI/V) from 
serine/threonine kinases (DXWA/SXGI/V) . Based on 
comparison of all known protein kinases, we designed 
degenerate oligonucleotide primers to subdomains I and 
IX that would pick up only CCK4 and its chicken 
homologue KLG by PGR. 
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10 



25 



We designed degenerate primers A and DVW based 
on conserved residues within the kinase domain of CCK4, 
to use for identification of novel kinases using 
polymerase chain reaction (PCR) . when applied to HEPm 
cell sscDNA as a template, multiple copies of CCK4 v.ere 
isolated as well as a novel DNA fragment (43-43) of 567 
bp with homology to other kinases. The novel sequence 
was most similar to Drosophila aurora kinase (GeneBank 
Accession #X834 65) and the clone was designated human 
Auroral. Using this fragment as a probe, we screened 
RNAs from a number of colon cancer cell lines and a 
human multiple tissue Northern blot, demonstrating an 
apparent selectivity in expression of Auroral in tumor 



cells. 



The Auroral probe was also used to screen a 
cDNA library constructed from human pancreatic cancer 
cell line mRNA to isolate overlapping clones spanning 
the complete open reading frame of Auroral. Of multiple 
clones isolated, seven corresponded to human Auroral . 
Two additional faintly hybridizing clones were also 
isolated during this screen and sequence analysis 
revealed they corresponded to a related, yet distinct 
kinase, which we designated human Aurora2. 

Recombinant ADRORAl and AURORA2 expressed in 
COS cells migrated with apparent Mr of 39,000 and 
46,000, consistent with their predicted molecular 
weights of 39264 and 46730 based on their primary amino 
acid sequence. This analysis confirms the recombinant 
protein can be stably produced in mammalian cells. 
Phosphorylation assays to determine target specificity 
30 of this putative kinases are ongoing. 



BNSOOOID: <WOL_87287oaA1J_> 



PCT/US96/I8859 

WO 97/22702 



54 

Specific immunoreagents were generated in 
rabbits against peptide sequence from the N-tertninal 
domains of AURl and AI3R2 have these reagents to localize 
expression of endogenous and recombinant Auroras within 
the cells. Furthermore, these reagents can be used in 
5 an effort to identify substrates for the AURORAS in 

order to better understand their normal biological role. 

Dominant negative and const it utively active 
forms of ADRl and AUR2 will be useful for delineating 
10 biologic consequences of either oblation or 

overexpression of these putative serine /threonine 
kinases- Initial studies with altered DNA constructs 
demonstrates that in just 2 hours following infection of 
NIH3T3 or BALB/3T3 cells with AURl or AUR2 retroviral 
15 stocks, cells become multinucleated. This phenotype 
persisted such that 2 days after infection some cells 
were found to have as many as 20 nuclei . The 
multinucleated cells typically had increased cytoplasm 
and diffuse cell boundaries. Immunostaining with both 
20 actin and DAPI, confirmed these nuclei were all 

contained within a single cell, and that the actin 
cytoskeleton was apparently normal. Ongoing experiments 
will address the chromosome content and intactness 
within the nuclei, the number and location of 
25 centrosomes, and the general organization of the 

microtubule network. 

Characterization of the long term consequences 
of expression of AURl or AUR2 normal, const i tut ively 
active and dominant negative are underway, and 
30 particularly whether they induce or reverse cellular 
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transformation. Stable clones are being isolated that 
express these recombinant proteins. They will be 
characterized for growth rate, DNA synthesis, 
cell-contact inhibition (foci formation), 
anchorage -independent growth (soft agar assays), and 
5 tumorigenicity in nude mice. 

What role do the Auroras play in mammalian 
centrosome replication and segregation? Disruption or 
deregulation of such functions is known to result in 
chromosome missegregation, monopolar spindles, and 
0 asymetric nuclear division in yeast, drosophila, and 

amphibians. The homology between human Auroras and the 
yeast IPLl and drosophila aurora is striking. We 
therefore desire to assess if the human auroras are 
functional equivalents of the yeast IPLl. The yeast iPLl 
> gene is required for high-fidelity chromsome segregation 
in Saccharomyces cerevisiae (Francisco, L., et al . , Mol. 
Cell. Biol. 14:4731-4740, 1994) and a temperature 
sensitive mutant has been isolated. We plan to 
determine if various full length and chimeric versions 
of the human Auroras can complement this temperature 
sensitive mutant in yeast. The chimeric constructs 
containing the N- terminal domain of IPLl and the kinase 
domain of the human auroras will permit us to determine 
if the kinase is functionally equivalent, while avoiding 
potential regulatory or intracellular localization roles 
mediated by the less well conserved N- terminal domains. 
Should the human genes compensate for the IPLl loss of 
function, we could then use such a line to screen for 
small molecule inhibitors of the human aurora kinase. 
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piyy^ ffliT.Ti! li yinT.TCffniAR CLONING 

Total RNAs were isolated using the Guanidine 
Salts/Phenol extraction protocol of Chomczynski and 
Sacchi {P. Chomczynski and N. Sacchi, Anal. Biochera. 
162, 156 (1987) from normal hioman prostate, duodenum, 
5 ovary, liver, pituitary, brain, thymus, and salivary 

gland, from human HEPM cells (palatal mesenchyme) , from 
primary human Wilm's tumor and ovarian carcinoma, and 
from human tumor cell lines originating from 
colon/rectum (HT29. SW480, SW1463, SW1417, SW837, 
10 SW948, SW620, SW403, SW1116, T84, HTC15, LS123, and 

Caco-2), kidney (CaKi-1, CaKi-2) , liver (SK-HEP-1) , 
pancreas {HS766T, ASPC, Capan-1) , and breast (MCF7) . 

These RNAs were used as templates to generate 
single -stranded cDNAs using the Superscript 
15 preamplif ication System for First Strand Synthesis kit 
purchased from GibcoBRL (Life Technologies, U.S.A.; 
Gerard, GF et al . (1989), FOCUS 11, 66) under conditions 
recommended by manufacturer. A typical reaction used 10 
ug total RNA or 2 ug poly (A)* RNA with 1.5 ug 
20 oligo(dT)„.x, in a reaction volume of 60 ul. The product 
was treated with RNaseH and diluted to 100 ul with H2O. 
For subsequent PCR amplification, 1-4 ul of these 
sscDNAs were used in each reaction. 

Oligonucleotides were synthesized on an 
25 Applied Biosystems 394 DNA synthesizer using established 
phosphoramidite chemistry and were used unpurif ied after 
precipitation with ethanol. The degenerate 
oligonucleotide primers are: 

A = 5'-GARTTYGGNGARGTNTTyYTNGC-3' (SEQ ID NO: 16) 
30 (sense) and 
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DVW = 5 • -AGNACNCCRAANGCCCACACRTC-3 ' (SEQ ID NO: 17) 
(antisense) . 

These primers were derived from the peptide 
sequences EFGEVFIA (SEQ ID NO: 18) (sense strand from 
kinase subdoraain I) euid DVW(A/S)PGVL (antisense strand 
from kinase subdomain IX) , respectively. Degenerate 
nucleotide residue designations are: N = A, C, G, or T; 
R = A or G; and Y = C or T. Using CCK4 as a template, 
these primers produce a product of 567 bp. 

A PGR reaction was performed using Primers A 
and DVW applied to the single -stranded sources listed 
above. The primers were added at a final concentration 
of 5 uM each to a mixture containing 10 mM Tris HCl 
(pH8.3), 50 mM KCl, 1 . 5 mM MgCl,, 200 uM each 
deoxynucleoside triphosphate, 0.001% gelatin, and 1.5 u 
15 AmpliTaq DNA Polymerase (Perkin-Elmer/Cetus) , and 1-4 ul 
cDNA. Following 3 min denaturation at 95°C, the cycling 
conditions were 94«C for 30 s, 3 7''C for 1 min, a 2 min 
ramp to 72''C, and 72«>C for Imin for the first 3 cycles, 
followed by 94<>C for 30 s, 50»C for 1 min, and 72»C for 
Imin 45 s for 35 cycles. PGR fragments migrating at 
between 500-600bp were isolated from 2% agaorse gels 
using GeneClean (BiolOl) , and T-A cloned into the pCRIl 
vector (Invitrogen Corp. U.S.A.) according to the 
manufacturer • s protocol . 
25 Colonies were selected for mini plasmid 

DNA-preparations using Qiagen columns and the plasmid 
DNAs were sec[uenced using cycle sequencing 
dye -terminator kit with AmpliTaq DNA Polymerase, FS 
(ABI, Foster City, CA) . Sequencing reaction products 
were run on an ABI Prism 377 DNA Sequencer, and analyzed 



20 



30 
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using the BLAST alignment algorithm (Altschul, S.F. et 
al., J- Mol. Biol. 215:403-10) . A novel clone (#43-43) 
was isolated by PCTR with primers A and DVW on 
single -stranded cDNA from human embryonic palatal 
mesenchyme (HEPM or CRL1486) as a template. This clone 
5 was subsequently designated as a fragment of human 
Auroral . 

A lambda ZapII (Stratagene Cloning Systems, La 
Jolla, CA) cDNA library was constructed using mRNA from 
a pool of pancreatic carcinoma cell lines as a template 

10 for first strand cDNA synthesis. Phage were screened on 
nitrocellulose filters with the random primed "P-labeled 
insert from p43-43 encoding human Auroral at 2x10^ cpm/ml 
in hybridization buffer containing 6xSSC, Ix Denhardt • s 
reagent, 0.1% SDS, with O.lmg/ml denatured, fragmented 

15 salmon sperm DNA. After overnight hybridization at 65«C, 
filters were washed in O.lxSSC, 0, 1%SDS at 65^C. Full 
length cDNA clones were sequenced on both strands using 
manual sequencing with T7 polymerase and oligonucleotide 
primers (Tabor and Richardson, 1987, Proc. Natl. Acad. 

20 Sci. U.S.A. 84: 4767-71). 

^.y7VMPt.E 2 s WQRTHERW BLOT ANALYSIS 

Northern blots containing 2 ug poly A+RNA per 
lane from 16 different adult human tissues {spleen, 

25 thymus, prostate, testis, ovary, small intestine, 

colonic mucosa, heart, brain, placenta, lung, liver, 
skeletal muscle, kidney, pancreas, and peripheral blood 
leukocytes) , four different human fetal tissues (brain, 
lung, liver, and kidney) , and 8 human cancer cell lines 

30 (HL60, HeLa, K-562, MOLT-4, Raji, SW480, and G361) on a 
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charge -modified nylon membrane were obtained from 
Clontech (Palo Alto, CA) . Additional Northern blots 
were prepared by running 10 ug total RNA isolated from 
human tumor cell lines on a denaturing formaldehyde 1.2% 
agarose gel and transferring to nylon membranes. 
5 Filters were hybridized with random prime 

t^^^PldCTP- labeled probes synthesized from either the 527 
bp insert from human Auroral clone 43-43 or the 1162 bp 
EcoRI fragment from pSG20, and either the Ikb EcoRI 
fragment of human Aurora2 clone 11 -lA or the 1257 bp 
10 BamHI-Not I fragment from pS621. Hybridization was 
performed at 60**C overnight in 6XSSC, 0.1% SDS, IX 
Denhardt's solution, lOO mg/ml denatured herring sperm 
DNA with 1-2 X 10^ cpm/ml of ^^p. labeled DNA probes. The 
filters were washed in O.IXSSC/O.1% SDS, 65<»C, and 
15 exposed overnight on Kodak XAR-2 film. 

A single AURl mRNA transcript of approximately 
1.4kb was identified, and was found to be most abundant 
in the thymus and small intestine with weak signals from 
testis, ovary, colon, placenta, and spleen. Prostate 
20 and peripheral blood lymphocytes were negative. Humsm 

fetal liver and kidney were also positive, with a weaker 
signal in fetal lung and no expression in fetal brain 
(Table) 

A similar analysis of human AUR2 expression 
25 showed a more restricted expression profile. A single 
2.4 kb AUR2 transcript was detected strongly in the 
adult testis and thymus, and weakly in [heart, placenta, 
skeletal muscles] and in fetal liver and kidney whereas 
the other normal tissue sources were negative (see 
30 Table) . 



BNSOOCID: 'WO_Sf72270aAl_U> 



wo 97/22702 



PCTAJS9fi/18859 



60 

PAGE INTENTIONALLY LEFT BLANK 



BNSDOCID: <WO_9722702A1_L> 



wo 97/22702 



PCT/US96/18859 



61 



AURORA 1 and AURORA 2 NORTHERN ANALYSIS IN HUMAN NORMAL 

TISSUE AND CANCER CELLS 



Cell type 



Origin 



AUR 1 



AUR 2 



Thymus 
Petal liver 
Fetal kidney 
Lung 

Duodenum 
10 I Colon 

Fetal lung 
Ovary 
Testis 
Brain 
15 I Cerebellum 

Salivary gland 
Heart 
Liver 
Pancreas 
20 I Kidney 
Spleen 
Stomach 
Uterus 
Prostate 
25 I Skeletal muscle 
Fetal brain 
PBLs 

Salivary gland 



Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 
Normal tissue 



5 

4 

4 

3 

2 

2 

2 

2 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



4 

2 

1 

0 

1 

0 

0 

0 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 
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Cell type 




&TTP 1 


ATTP 9 


Placenta 




n 


0 


SF-268 




A 




CCRF - CEM 


T Ai 1 It om *i a 


4 


ND 


K-562 


XjeuKenixa 


A 
H 


MHXJ 


HCC-2998 


coxon uuniOi 


A 




SW620 


coxon uUTnor 


A 
*X 




KM-12 


Leukemxa 


A 
H 


nx/ 


MCF7 / ADR -RES 


xaxeasu uuiuoxr 


A 




MDA-N 




A 
'x 


XX JL/ 


BT-54 9 


Boreast turaoi" 


A 
*± 




SW480 


coxon uuinox 


A 
*X 


4 


SW48 


L-OxOn uuluOx 


A 




Calu-3 


xjVuig uuiuOx 


A 




Calu3 


IjUli^ UUllllOx 


4 




T4 7D 


cxeasu ciiiuox 


A 


2 


A375 


Me X anoma 


A 
*x 


n 


SF767 


uumox 


4 


0 


SW1417 


L.oxon uuniox 


4 


4 


CaKi2 


Kxaney CLiiuor 


A 
*t 


V 


CaKil 


KiQney diinox 


A 
rt 


0 


Caco2 


Colon tutnoir 


4 


4 


SW1417 


Colon tumor 


4 


0 


T98G 


CNS tumor 


4 


0 


SF-539 


CNS tumor 


3 


ND 


SK-MEL-2 


Melanoma 


3 


ND 


SK-MEL-5 


Melanoma 


3 


ND 
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5 



10 



15 



20 



25 



j Cell type 


Origin 


ATTP 1 
aUxv X 


AX7R 2 


j R-48 




3 


ND 


j RF-1 


vja.ouxj.C uUmOJ: 


3 


ND 


SW94e 


Colon t*nfTir^Y- 


3 


ND 


AGS 


Ga-Stl^ir* t"iimr^*- 


3 


ND 


1 HFLl 


NOZmal 1 iinrr 

*v^^<*« ■■IwL^L ' 1 IS] 


3 


ND 


OVCAR-8 


Ova 1^1 ar> t'liinrx'v 


2 


ND 


HT-29 


Colon t"i'!mr%^ 


2 


ND 


MDA-MB-231 




2 


ND 


MDA-MB-435 


BlTGast" t"iiTnor" 


2 


ND 


SK-MEL-5 


snoTfiA 


2 


ND 


Kato-3 


v»«»t.xiC uliniOx 


2 


ND 


Colo 205 


Co 1 OTl ^1t¥W^^ 


2 


ND 


1 Colo 320OM 


wtj-Lon uuiuOjr 


2 


2 


j WiDr 


v-^J-Lon uuiuOx 


2 


ND 


j HT-29 


v-oxon tumor 


2 


ND 


1 SNU-C2B 


Colon t"nmr\>* 


2 


ND 


1 HTC15 


Col on ^iiwirt-v 


2 


2 


j T84 


Col on t"iiTifir\T» 


2 


0 


SW94 8 


Colon tuTnoY" 


2 


0 


j Daoy 


CNS tumor 


2 


0 


1 OVCAR3 


Ovary tumor 


2 


0 


HS766T 


Pancreas tumor 


2 


0 


1 SW1116 


Colon tumor 


2 


0 


1 Wilms tumor 


Kidney tumor 


2 


0 


1 UO-31 


Renal tumor 


0 


ND 
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The AURl mRNA expression profile in several 
primary tumors and multiple cell lines of diverse 
neoplastic origin were determined by Northern analysis 
and by the semi -quantitative PGR assay using primers 
from sequences in the AURl kinase domain. The results 
5 are included in Table. AURl transcripts were detected 
in every tumor line assayed with the highest expression 
in several human colon cancer cell lines (SW480, 
Colo320, SW620, SW1417, Caco2, SW12417) and in lung 
carcinoma (CaluB) , breast carcinoma (T47D, MCF7) . 
10 Melanoma {A375) , Kidney carcinoma (CaKi-1, CaKi-2) , 

liver carcinoma (SK-HEP-1) , and neural tumors (SF767, 
T98G) . Lesser expression of AURl was seen in other 
colon carcinomas (HTC15, T84, SW948, SW1116 , HT29) , 
neural tumors (Daoy) , Ovarian carcinoma (Ovcar3. Primary 
15 tumor) , pancreatic carcinoma (HS766T) , and a primary 

kidney tumor. 

The AUR2 expression profile in tumor cell 
lines was strikingly more restricted than that of AURl. 
Strong expression of AUR2 was detected only in colon 
20 carcinoma cell lines {Caco2, SW480, SW1417, SW620) 

whereas weak signals were seen in other colon (HTC15, 
Colo320), Breast {T47D, MCF7) and lung {Calu3) tumor 
cell lines. Several other tumor lines had no detectable 
AUR2 transcripts. 



25 



fTi^ff^j^ f , ^; fiBMT-Q TTivy rrT'^^'^-rvg DETECTION OF AURQRAX 

RNA was isolated from a variety of human cell 
lines, fresh frozen tissues, and primary tumors. Single 
stranded cDNA was synthesized from 10 mg of each RNA as 
30 described above using the Superscript Preamplif ication 
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System (GibcoBRL) . These single strand templates were 
then used in a 35 cycle PGR reaction with two 
ADRORAl- specific oligonucleotides (3476: 
5 ' -TTTC3GCTCGGGAGAAGAAAAGCCAT-3 • (SEQ ID NO: 19) , and 
3506 : 5 ' -CAATCATCTCTGGGGGCAGGTAGT-3 ■ ) (SEQ ID NO : 20 ) . 
5 Reaction products were electrophoresed on 2% agarose 

gels, stained with ethidium bromide and photographed on 
a DV light box. The relative intensity of the -475 -bp 
ADRORAl-specific bands were estimated for each sample. 

10 ByftWPIiP 4; SOUTHERN BLOT AMAT.vpT.c 

Genomic DNA was isolated from a variety of 
transformed human lines (CaC02, HTC15, LS147T, SKC04, 
SW480. SW403. SW620, SW948, SW1417, SW1116, MCF7, BT474) 
using standard procedures (Maniatis et al.). Cells were 
trypsinized, washed with PBS and resuspended at -lO' 
cells/ml in Digestion buffer (lOOmM NaCl, lOmM Tris pH8. 
25 mM EDTA, pHB, 0.5%SDS, O.i mg/ml proteinase K) . Cells 
were lysed by incubation at 50°C for 12 hours, followed 
by extraction with phenol /chloroform and precipitated 
with and equal volume of 7.5M ammonium acetate and 100% 
EtOH. DNAs were resuspended in TE buffer. 
Approximately 20 micrograms genomic DNA was digested 
with Hindlll or Xholl at 37«»C for at least 4 hours before 
fractionation on 1% agarose gels. The DNA fragments were 
transferred to nitrocellulose membranes by the capillary 
transfer method (Southern, EM, J moI Bio 98:503, 1975) 
and hybridized with human Auroral and Aurora2 -specif c 
probes as described for Northern Blot analysis above. 

DNAs were restricted with Hindi I I since both 
AURi and ADR2 cDNAs contain a single site for this 



wo 97/22702 



PCT/US96/18859 



66 



restriction enzyme. AURl showed a single 4.3 kb band of 
equal intensity from all sources suggesting it is a 
single copy, non- rearranged gene in the multiple tumor 
types assayed. However, under low stringency 
conditions, we were able to detect 1.3 kb and 3.2 kb 
5 Sad fragments which weakly hybridize to the AXJRl probe. 
Cloning and sequence analysis reveals this region to 
encode an intronless AURl -related pseudogene (termed 
AUR3), with multiple frame shifts. Furthermore, 
immediately upstream of the AUR3 pseudogene is a region 
10 with complex inverted repeats predicted to form a very 
stable hairpin loop. AUR3 DNA sequence is homologous to 
AURl beginning from the first nucleotide of the AURl 
cDNA. immediately upstream of this site, is the 
predicted hairpin loop of AUR3 . We are currently 
15 characterizing AURl genomic clones to determine if the 
homology to AUR3 continues upstream from this 
nucleotide, and whether the AURl cDNA includes or be 
preceded by a similar hairpin loop. AUR2 showed bands 
at 7.0 kb and 4.3 kb and a faint higher molecular weight 
band at ~10 kb from all sources. These data suggest 
AUR2 is also a single copy gene. The multiple bands 
with seen on blots probed with AUR2 are likely due to 
the fact that a full length cDNA probe was used. 



20 



25 



30 



^Xft ^T.Tg 5. fiBOTTEMCE W^r.^f^rs QP «DNA CLONES ENCQPIWg 

The con5)lete sequence of human Auroral and 
Aurora2 was determined from full length clones of each 
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isolated from the human pancreatic carcinoma library, 
from normal human duodenum, and of the partial human 
Auroral isolated from HEPM cells. 

The 1,244 bp human Auroral (AURl_h) nucleotide 
sequence is shown in SEQ ID NO:l or SEQ ID NO: 2 and 
5 contains a single open reading frame encoding a 

polypeptide of 344 amino acids. The AURi_h coding 
region is flanked by a 54 nucleotide 5 ' -untranslated 
region and a 132 nucleotide 3 • -untranslated region 
ending with a poly (A) tail. 
0 The 2,198 bp human Aurora2 (AUR2_h) nucleotide 

sequence is shown in SEQ ID N0:1 or SEQ ID NO: 2 and 
contains a single open reading frame encoding a 
polypeptide of 403 amino acids. The AUR2_h coding 
region is flanked by a 200 nucleotide 5 • -untranslated 
5 region and a 768 nucleotide 3 ' -untranslated region. 

The sequences ADRl and AUR2 cDNAs were 
sequenced from both a human pancreatic tumor and normal 
human duodenum, with no sequence differences excepting 
some likely polymorphic sites. These ambiguities 
) include : 
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cDNA nucleotide Comment 

AURl 1174 one clone has poly A inserted 

973 T in all duodenal clones, C in pancreatic 

tumor 

469 T in one clone, C in all others 

848 G in one clone, A in all others - changes 

cunino acid E to G 

1097 G in one clone, T in 2 others 

956 G in one clone, A in 4 others 

29 Splice to 103 in 5 clones, no splice (as sovTn 

in 5 clones 



AnR2 



349 



369 



T in 1 cline, C in multiple others (change 
amino acid P to L) 

A in 3 clones, G in multiple others (change 
AA V to I) 



The C- terminal portions AURl and AUR2 conserve 
all 12 subdomains characteristic of eukaryotic protein 
kinases. This AURl and AUR2 kinase domains is preceded 
by a N- terminal domain are 74 and 13 0 amino acids, 
respectively. Comparison of the AURl and AUR2 
nucleotide and deduced amino acid sequences (SEQ ID NO: 3 
or SEQ ID NO: 4) with the available DNA and protein 
sequence databases indicated that they are unique with 
the exception of several EST sequences sharing high 
sequence identity. They do however have striking 
homology in both the N-terminal and catalytic domains 
with the drosophila aurora and SaccharomycGS cerevisiae 
IPLl genes. Furthermore two unpublished database 
entries are likely to be close homologues from Xenopus 
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laevis (P4 6APK - GB accession #Z17206 and p46BPK - GB 
accession #217207) . 

The N- terminal domains of Auroras from human, 
frog, Drosophila and yeast share limited sequence 
identity. The requiring AUR2 has an abundance of 
glutamates often present in pairs separated by a single 
residue . 

Comparison of the catalytic domains of these 
protein reveals AURl shares 70% amino acid identity with 
AUR2. 61% with the Drosophila aurora, and 45% with the 
yeast IPLI gene. AUR2 kinase shares 60% amino acid 
identity with the Drosophila protein and 45% identity to 
yeast IPLl. Both AURl and ADR2 share less than 45% 
homology with all other known mammalian kinases (the 
closest being cAMP- dependent protein kinase A) 
suggesting they are homologues of these drosophila and 
yeast kinases. 

ADRl and AUR2 both contain a cAMP- dependent 
protein kinase phosphorylation site (THR232 of AURl and 
THR288 of AUR2) that is conserved in the drosophila and 
yeast homologues and is a kno%m regulatory site in the 
cylclin-dependent kinase p34cdc2. AUR2 contains an 
additional PKA-site at SER342. Both proteins also have 
multiple Casein kinase II (five and six for AURl and 
AUR2) and protein kinase C (four and ten for AURl and 
ADR2) phosphorylation sites. AUR2 also has a single 
tyrosine phosphorylation consensus site at TYR334 that 
is also conserved with the Drosophila aurora, but is not 
present in AURl or yeast IPLl . 

Intriguingly, natural mutants of the 
drosophila aurora AUR_dm and yeast IPLl gene result in 



WO97/22702 PCT/US96/18859 



70 



asymmetric nuclear division leading to chromosome 
missegregation, and atypical, monopolar spindles. This 
phenotype appears to result from a failure of centrosome 
separation. The associated microtubule architecture 
appear unaffected. Natural mutants in both drosophila 
5 and yeast target amino acid residues that are strictly 
conserved between the human Auroras, further supporting 
they may be functional homologues. The corresponding 
residues in AURl that are found in natural mutants of 
ADR_dm or IPLl are GLU125, THR232, PR0312, HIS324 . All 
10 of these mutations are within the catalytic domain, and 
notably, one represents the conserved 
PKA-phosphorylation site. An additional mutation in 
AUR_dm at ASP47 is at a non- conserved residue in the 
N-terminal domain. 

These findings suggest the catalytic activity 
may indeed play a central role in the biology of 
centrosome replication or segregation in lower 
organisms, and suggest that the human Auroras may play a 
complementary role in mammalian cells. 



15 



20 



FiTfftTI PT i F *^ ' RBr r> F^TWAMT RXPRESSION OF ATJRORA and 

Ey ppiaggTQM yv-rTOR cOHgTRWOTiON 

Expression constructs were generated by 
PCR-assisted mutagenesis in which the entire coding 
25 domains of Auroral and Aurora2 were tagged on their 
carboxy-terminal ends with the hemophilus influenza 
hemaglutinin (HA) epitope YPYDVPDYAS (SEQ ID N0:21) 
(Pati, 1992) . These constructs were introduced into two 
mammalian expression vectors: pLXSN (Miller, A.D. & 
Rosman, G.J., Biotechniques 7, 980-988, 1989) for the 
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generation of virus producing lines; and pRK5 for 
transient expression analysis. Inserts were designed to 
be flanked by unique BamHI and Not I sites and were 
cloned directly into pLXSN or pRK5 at the 5 '--BamHI and 
3'-NotI sites. 
5 The BamHI -Not I f ull length AURl and AUR2 

constructs were also ligated into pRS316 (Liu, H. et al. 
Genetics 132:665-673, 1992). This vector contains a 
galactose -inducible promoter in a centromeric shuttle 
vector for expression in Saccharomyces cerevisiae. 
These are to assess if the human genes can complement 
the related ten5)erature sensitive yeast IPLl mutant, 
which is closely related to AURl. In addition, fusion 
constructs containing the N-terminal domain of yeast 
IPLl fused to the C-terminal kinase domains of AURl and 
AUR2 were generated. These were produced by insertion of 
an artificial Clal site at the 5' end of the kinase 
domains of the kinases, at the conserved Asp-Asp-Phe-Glu 
sequence . 

Dominant negative AURl and AUR2 constructs 
were also made in both pLXSN and pRK5 by mutation of the 
invariant Lys (amino acid positions 106 and 162 in AURl 
and AUR2, respectively) to an Met by PGR mutagenesis. 
The constructs are termed AURIKM and AUR2KM. 
Constitutively active forms of AURl and AUR2 were 
generated by mutation of the DNA heading the encoding 
the phosphoirylation site (232 and 288) to an Asp 
resulting in AURITD and AUR2TD. 

Expression constructs in both pLXSN and pRK5 
were also made containing just the N-terminal, 
non-catalytic domain of AURl and AUR2 . These were 
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generated by PGR from the parental constructs and 
contain the N- terminal 77 amino acids of AXJRl and 132 
amino acids of AUR2. 

The entire AURl cUid AUR2 open reading frames 

(no HA- tag) excluding the initiating methionines were 
generated by PGR and ligated into pGEX vector for 
bacterial production of GST- fusion proteins for 

immunization of rabbits for antibody production. 



B y y^ ^T.g 7. mnmtmatf nw virus PROPnciNG atjrora CELL 
10 JaTSXES 

To generate high-titer virus stocks, pLXSN 
recombinant constructs containing either AURl or AUR2 
genes were transfected into an amphotropic helper cell 
line PA317 using CaClj-mediated treois feet ion. After 
15 selection on G418, the cells were plated on normal media 
without G418 (500 ug/ml) . Supernatants from resistant 
cells were used to infect the ecotropic helper cell line 
GP+E86, and cells again selected on G418 . Resistant 
cells were again taken off G418, and the supernatants 
harvested every 8-12 hours and pooled as virus stock 
(Redemann, N. , Holzmann, B., Wagner, E.F., Schlessinger , 
J., & Ullrich, A., Mol. Cell. Biol. 12, 491-498, 1992). 
Viral stock titers were typically -lO«/ml. 



20 



25 unrKMPT.B 8r pi^trovirai. infection QF NXH-3T3 CBbfaS VTITH 
aORQRAS 

NIH-3T3, and BALB/3T3 cells were grown in 100 
mm plates with DMEM (Gibco) containing 10% fetal calf 
serum (PCS) . The cells were superinfected with the AURl 
30 and AUR2 retrovirus by adding approximately 3 ml viral 
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supernatant to 15 ml culture media for approximately 24 
hours. Cells expressing the retroviral constructs were 
then selected by growth in DMEM/10% FCS supplemented 
with 500 ug/ml G418. 

BaHtfiPTiB 9; GENERATION OF AURORA- SPEgTF IC TMMTTKrOREAQEMT 

AURORA- specif ic immunoreagents were raised in 
rabbits against KLH- conjugated synthetic peptides 
corresponding to either the N- terminal region of AUR2 
(i^SAPENNPEEQLASK"^) (SEQ ID NO: 22) and 

('^PLNNTQKSKQPL^o2) ^gEQ ID NO: 23) or the N- terminus or 
N- terminal domain of human AURl (^MAQKENSYPWPYG") (SEQ 
ID NO: 24) and ("PGQKVMENSSGTP") (SEQ ID NO: 25) . 
Additional immunoreagents were generated by immunizing 
rabbits with the bacterially expressed full length AURi 
and AUR2 GST- fusion proteins. 

KKAMPIiE 3,0: TRANSIENT EXPRESSION OF AtTRnP2ip jj^ 

MAMMALIAN CELLS 

The pRK5 expression plasmids (lOug DNA/lOO mm 
plate) containing the HA- tagged AURl and AUR2 genes were 
introduced into COS and 293 cells with lipofect amine 
(Gibco BRL) . After 72 hours, the cells were harvested 
in 0.5 ml solubilization buffer (20 mM HEPES pH7.35, 150 
mM NaCl, 10% glycerol, 1% Triton X-lOO, 1 . 5 mM MgCl2, 1 
mM EGTA, 2 mM phenylmethylsulf onyl fluoride, 1 ^g/ml 
aprotinin) • Sample aliquots were resolved by SDS 
polyacjrylamide gel electrophoresis (PAGE) on 
15%acrylamide/0 • 5% bis-acrylamide gels and 
electrophoretically transferred to nitrocellulose. 
Non-specific binding was blocked by preincubating blots 
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in Blotto (phosphate buffered saline containing 5% w/v 
non-fat dried milk and 0.2% v/v nonidet P-40 (Sigma)), 
and recombinant protein was detected using a murine Mab 
to the HA decapeptide tag. Alternatively, recombinant 
protein can be detected using various AURl- or 
5 AUR2 -specific antisera. 

KyAMPLE 11: MYELIN B ASIC PROTEIN IS AN ARTIFICIAL 
fiTTBSTRATE F OR ATTRORAL AND AURORA2 KINASE 

10 Human colorectal adenocarcinoma SW480 cells 

were cultured in RPMI 1640 plus 10% fetal bovine serum, 
L-glutamine, penicillin and streptomycin. Confluent 
cultures of SW480 cells were washed three times with ice 
cold phosphate buffered saline (PBS) and then were 

15 scraped into 1 ml of ice cold PBS . The cells were 

centrifuged at 1,000 rpm at 4''C, the PBS aspirated away, 
and the resulting cell pellet stored at -SO^'C. The 
pellets from three 15 cm plates were thawed on ice and 
resuspended in a total of 1 ml of kinase lysis buffer 50 

20 mM HEPES pH 7.4, 100 mM KCl, 25 mM NaP, 1 mM NaVOa, 0.5% 
NP40, 1 raM DTT, 2 ug/ml aprotinin^ and 1 ug/ml 
leupeptin) and were rotated gently for 20 minutes at 
4**C. The samples were then centrifuged at 10,000 x g 
for 10 minutes at 4'*C and the resulting supernatant was 

25 transferred to a clean 1.5 ml centrifuge tube and stored 
or kept on ice. The protein concentration was 
determined by Bradford analysis. One milligram of total 
protein was pre-cleared with 10 fil of protein A- 
Sepharose (Boehringer) for 15 minutes at 4*'C followed by 

30 the addition of 2 fil of either rabbit pre-iiranune serum, 
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affinity purified auroral peptide antisera, affinity- 
purified auroral peptide antisera plus 6 fig of competing 
auroral peptide, affinity purified aurora2 peptide 
antisera, or, affinity purified aurora2 peptide antisera 
plus 6 ftg of competing aurora2 peptide and incubated for 
30 minutes at 4'C. Subsequently, 10 ^1 of protein A- 
sepharose was added and the incubation was continued for 
an additional 45 minutes at 4'*C. The tubes were briefly 
centrifuged to pellet the antibody-protein A-sepharose 
complex and the resulting supernatant was aspirated off. 
The antibody-protein A-sepharose pellet was washed twice 
with 0.5 ml of kinase lysis buffer followed by a wash 
with 0.5 ml of kinase buffer (20 mM HEPES pH 7.4, 125 mM 
KCl, 10 mM MgCla, 1 mM NaF, 1 mM NaVO,, and 1 mM DTT) . 
The antibody-protein A-sepharose pellet was resuspended 
in 20 nl of kinase buffer containing 5 uCi of [y-"P] ATP 
and 0.5 mg/ml myelin basic protein (Sigma), incubated 
for 20 minutes at 37*C after which 10 ^1 of protein 
sample buffer 200 mM Tris-Cl pH 6.8, 40% glycerol, 730 
mM B-mercaptoethanol, 0.4% SDS, and 0.05% Bromophenol 
Blue) was added. The tubes were mixed well and 
incubated for 5 minutes at 100 "C. The samples were 
resolved on an 18% SDS polyacrylamide gel and visualized 
by autoradiography. 
Results 

Auroral and aurora2 immunocomplexes were able 
to phosphorylate myelin basic protein. When competing 
peptide was used in the immunoprecipitations neither 
auroral nor aurora2 antsera immunocomplexes were able to 
phosphorylate myelin basic protein more than the pre- 
immune sera control. This suggests that the kinase 
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activity observed is due to auroral and 2 and not to 
other proteins present in the immunocomplex , 

This observation will allow for the 
purification of active auroral and 2 kinase by using 
myelin basic protein as a substrate to follow kinase 
5 activity. It also will allow the development of an in 
vitro kinase assay using recombinant auroral and 2 
proteins. Furthermore an auroral and 2 in vitro kinase 
assay will allow one to screen small molecule 
collections for inhibitors of the auroral and 2 kinases 
10 by measuring the inhibition of phosphorylation of myelin 
basic protein. 

The invention illustratively described herein 
suitably may be practiced in the absence of any element 

15 or elements, limitation or limitations which is not 
specifically disclosed herein. The terms and 
expressions which have been employed are used as terms 
of description and not of limitation, and there is no 
intention that in the use of such terms and expressions 

20 of excluding any equivalents of the features shown and 

described or portions thereof, but it is recognized that 
various modifications are possible within the scope of 
the invention claimed. Thus, it should be understood 
that although the present invention has been 

25 specifically disclosed by preferred embodiments and 
optional features, modification and variation of the 
concepts herein disclosed may be resorted to by those 
skilled in the art, and that such modifications and 
variations are considered to be within the scope of this 

30 invention as defined by the appended claims. 
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Those references not previously incorporated 
herein by reference, including both patent and non- 
patent references, are expressly incorporated herein by 
reference for all purposes. Other embodiments are 
within the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Mossie, Kevin G. 

Plowman, Gregory D. 

(ii) TITLE OF INVENTION: Human Aurora Application 

5 (iii) NUMBER OF SEQXJENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Lyon & Lyon 

(B) STREET: 633 West Fifth Street, Suite 4700 
1Q (c) CITY: Los Angeles 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 90071-2066 

(v) COMPUTER READABLE FORM: 
15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi> CURRENT APPLICATION DATA: 
20 (A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Warburg, Richard J. 
25 (B) REGISTRATION NUMBER: 32,327 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (213) 489-1600 

(B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 



30 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 base pairs 

(B) TYPE: nucleic acid 

(C) STRRNDEDNESS : single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CGGGAGAGTA GCAGTGCCTT GGACCCCAGC TCTCCTCCCC CTTTCTCTCT AAGGATGGCC 60 
5 CAGAAGGAGA ACTCCTACCC CTGGCCCTAC GGCCGACAGA CGGCTCCATC TGGCCTGAGC120 
ACCCTGCCCC AGCGAGTCCT CCGGAAAGAG CCTGTCACCC CATCTGCACT TGTCCTCATG180 
AGCCGCTCCA ATGTCCAGCC CACAGCTGCC CCTGGCCAGA AGGTGATGGA GAATAGCAGT240 
GGGACACCCG ACATCTTAAC GCGGCACTTC ACAATTGATG ACTTTGAGAT TGGGCGTCCT300 
CTGGGCAAAG GCAAGTTTGG AAACGTGTAC TTGGCTCGGG AGAAGAAAAG CCATTTCATC36 0 
10 GTGGCGCTCA AGGTCCTCTT CAAGTCCCAG ATAGAGAAGG AGGGC6TGGA GCATCAGCTG420 
CGCAGAGAGA TCGAAATCCA GGCCCACCTG CACCATCCCA ACATCCTGCG TCTCTACAAC480 
TATTTTTATG ACCGGAGGAG GATCTACTTG ATTCTAGAGT ATGCCCCCCG CGGGGAGCTC 

540 

TACAAGGAGC TGCAGAAGAG CTGCACATTT GACGA6CAGC GAACAGCCAC GATCATGGAG6 0 0 
15 GAGTTGGCAG ATGCTCTAAT GTACTGCCAT GGGAAGAAGG TGATTCACAG AGACATAAAG660 
CCAGAAAATC TGCTCTTAGG GCTCAAGGGA GAGCTGAAGA TTGCTGACTT CGGCTGGTCT72 0 
GTGCATGCGC CCTCCCTGAG GAGGAAGACA ATGTGTGGCA CCCTGGACTA CCTGCCCCCA780 
GAGATGATTG AGGGGCGCAT GCACAATGAG AAGGTGGATC TGTGGTGCAT TGGAGTGCTT84 0 
TGCTATGAGC TGCTGGTGGG GAACCCACCC TTCGAGAGTG CATCACACAA CGAGACCTAT900 
20 CGCCGCATCG TCAAGGTGGA CCTAAAGTTC CCCGCTTCTG TGCCCACGGG AGCCCAGGAC960 
CTCATCTCCA AACTGCTCAG GCATAACCCC TCGGAACGGC TGCCCCTGGC CCAGGTCTCA020 
GCCCACCCTT GGGTCCGGGC CAACTCTCGG AGGGTGCTGC CTCCCTCTGC CCTTCAATCT080 
GTCGCCTGAT GGTCCCTGTC ATTCACTCGG GTGCGTGTGT TTGTATGTCT GTGTATGTAT14 0 
AGGGGAAAGA AGGGATCCCT AACTGTTCCC TTATCTGTTT TCTACCTCCT CCTTTGTTTA200 
25 ATAAAG6CTG AAGCTTTTTG TAAAAAAACA AAAAAAAAAA AAAA 1244 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
10 GGGATATCTC AGTGGCGGAC GAGGACGGCG GGGACAAGGG GCGGCTGGTC GGAGTGGCGG 60 
ACGTCAAGTC CCCTGTCGGT TCCTCCGTCC CTGAGTGTCC TTGGCGCTGC CTTGTGCCCG120 
CCCAGCGCCT TTGCATCCGC TCCTGGGCAC CGAGGCGCCC TGTAGGATAC TGCTTGTTAC180 
TTATTACAGC TAGAGGCATC ATGGACCGAT CTAAAGAAAA CTGCATTTCA GGACCTGTTA240 
AGGCTACAGC TCCAGTTGGA GGTCCAAAAC GTGTTCTCGT GACTCAGCAA TTTCCTTGTC300 
15 AGAATCCATT ACCTGTAAAT AGTGGCCAGG CTCAGCGGGT CTTGTGTCCT TCAAATTCTT360 
CCCAGCGCGT TCCTTTGCAA GCACAAAAGC TTGTCTCCAG TCACAAGCCG GTTCAGAATC420 
AGAAGCAGAA GCAATTGCAG GCAACCAGTG TACCTCATCC TGTCTCCAGG CCACTGAATA4 8 0 
ACACCCAAAA GAGCAAGCAG CCCCTGCCAT CGGCACCTGA AAATAATCCT GAGGAGGAAC540 
TGGCATCAAA ACAGAAAAAT GAAGAATCAA AAAAGAGGCA GTGGGCTTTG GAAGACTTTG600 
20 AAATTGGTCG CCCTCTGGGT AAAGGAAAGT TTGGTAATGT TTATTTGGCA AGAGAAAAGC660 
AAAGCAAGTT TATTCTGGCT CTTAAAGTGT TATTTAAAGC TCAGCTGGAG AAAGCCGGAG720 
TGGAGCATCA GCTCAGAAGA GAAGTAGAAA TACAGTCCCA CCTTCGGCAT CCTAATATTC780 
TTAGACT6TA TGGTTATTTC CATGATGCTA CCAGAGTCTA CCTAATTCTG GAATATGCAC840 
CACTTGGAAC AGTTTATAGA GAACTTCAGA AACTTTCAAA GTTTGATGAG CAGAGAACTG900 
25 CTACTTATAT AACAGAATTG GCAAATGCCC TGTCTTACTG TCATTCGAAG AGAGTTATTC960 
ATAGAGACAT TAAGCCAGAG AACTTACTTC TTGGATCAGC TGGAGAGCTT AAAATTGCAG020 
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ATTTTGGGTG GTCAGTACAT GCTCCATCTT CCAGGAGGAC CACTCTCTGT GGCACCCTGffiOSO 

ACTACCTGCC CCCTGAAATG ATTGAAGGTC GGATGCATQA TGAGAAGGTG GATCTCTGGA14 0 

GCCTTGGAGT TCTTTGCTAT GAATTTTTAG TTGGGAAGCC TCCTTTTGAG GCAAACACAT200 

ACCAAGAGAC CTACAAAAGA ATATCACGGG TTGAATTCAC ATTCCCTGAC TTTGTAACAG260 

AGGGAGCCAG GGACCTCATT TCAAGACTGT TGAAGCATAA TCCCAGCCAG AGGCCAATG(C320 

5 TCAGAGAAGT ACTTGAACAC CCCTGGATCA CAGCAAATTC ATCAAAACCA TCAAATTGC(C380 

AAAACAAAGA ATCAGCTAGC AAACAGTCTT AGGAATCGTG CAGGGGGAGA AATCCTTGA(B440 

CrCAGGGCTGC CATATAACCT GACAGGAACA TGCTACTGAA GTTTATTTTA CCATTGACTffiS 0 0 

CTGCCCTCAA TCTAGAACGC TACACAAGAA ATATTTGTTT TACTCAGCAG GTGTGCCTTA560 

ACCTCCCTAT TCAGAAAGCT CCACATCAAT AAACATGACA CTCTGAAGTG AAAGTAGCCA620 

10 CGAGAATTGT GCTACTTATA CTGGTTCATA ATCTGGAGGC AAGGTTCGAC TGCAGCCGCffi680 

CCGTCAGCCT GTGCTAGGCA TGGTGTCTTC ACAGGAGGCA AATCCAGAGC CTGGCTGTG(B74 0 

GGAAAGTGAC CACTCTGCCC TGACCCCGAT CAGTTAAGGA GCTGTGCAAT AACCTTCCTA800 

GTACCTGAGT GAGTGTGTAA CTTATTGGGT TGGCGAAGCC TGGTAAAGCT GTTGGAATGA860 

GTATGTGATT CTTTTTAAGT ATGAAAATAA AGATATATGT ACAGACTTGT ATTTTTTCTfl:920 

15 TGGTGGCATT CCTTTAGGAA TGCTGTGTGT CTGTCCGGCA CCCCGGTAGG CCTGATTGG<B980 

TTTCTAGTCC TCCTTAACCA CTTATCTCCC ATATGAGAGT GTGAAAAATA GGAACACGTG04 0 

CTCTACCTCC ATTTAGGGAT TTGCTTGGGA TACAGAAGAG GCCATGTGTC TCAGAGCTGTIOO 

TAAGGGCTTA TrTTTTTAAA ACATTGGAGT CATAGCATGT GTGTAAACTT TAAATATGCAieO 

AATAAATAAG TATCTATGTC AAAAAAAAAA AAAAAAAA 2198 

20 (2) INFORMATICS FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 344 amino acids 
CB) TYPE: amino acid 
(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Gin Lys Glu Asn Ser Tyr Pro Trp Pro Tyr Gly Arg Gin Thr 
X 5 10 15 

Ala Pro Ser Gly Leu Ser Thr Leu Pro Gin Arg Val Leu Arg Lys Glu 
20 25 30 

5 Pro Val Thr Pro Ser Ala Leu Val Leu Met Ser Arg Ser Asn Val Gin 

35 40 45 

Pro Thr Ala Ala Pro Gly Gin Lys Val Met Glu Asn Ser Ser Gly Thr 
50 55 60 

Pro Asp He Leu Thr Arg His Phe Thr He Asp Asp Phe Glu He Gly 
10 65 70 75 80 

Arg Pro Leu Gly Lys Gly Lys Phe Gly Asn Val Tyr Leu Ala Arg Glu 
85 90 95 

Lys Lys Ser His Phe He Val Ala Leu Lys Val Leu Phe Lys Ser Gin 
100 105 110 

15 He Glu Lys Glu Gly Val Glu His Gin Leu Arg Arg Glu He Glu He 

115 120 125 

Gin Ala His Leu His His Pro Asn He Leu Arg Leu Tyr Asn Tyr Phe 
130 135 140 

Tyr Asp Arg Arg Arg He Tyr Leu He Leu Glu Tyr Ala Pro Arg Gly 
20 145 150 155 160 

Glu Leu Tyr Lys Glu Leu Gin Lys Ser Cys Thr Phe Asp Glu Gin Arg 
165 170 175 

Thr Ala Thr He Met Glu Glu Leu Ala Asp Ala Leu Met Tyr Cys His 
180 185 190 

25 Gly Lys Lys Val He His Arg Asp He Lys Pro Glu Asn Leu Leu Leu 

195 200 205 

Gly Leu Lys Gly Glu Leu Lys He Ala Asp Phe Gly Trp Ser Val His 
210 215 220 

Ala Pro Ser Leu Arg Arg Lys Thr Met Cys Gly Thr Leu Asp Tyr Leu 
30 225 230 235 240 

Pro Pro Glu Met He Glu Gly Arg Met His Asn Glu Lys Val Asp Leu 
245 250 255 
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Trp Cys lie Gly Val Leu Cys Tyr Glu Leu Leu Val Gly Asn Pro Pro 
260 26S 270 

Phe Glu Ser Ala Ser His Asn Glu Thr Tyr Arg Arg He Val Lys Val 
275 280 28S 

Asp Leu Lys Phe Pro Ala Ser Val Pro Thr Gly Ala Gin Asp Leu He 
290 295 300 



ser Lys Leu Leu Arg His Asn Pro Ser Glu Arg Leu Pro Leu Ala Gin 

320 



305 310 315 



Val Ser Ala His Pro Trp Val Arg Ala Asn Ser Arg Arg Val Leu Pro 
325 330 335 

10 Pro Ser Ala Leu Gin Ser Val Ala 

340 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I<£NGTH: 403 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
20 (iv) ANTI-SENSE: NO 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Asp Arg Ser Lys Glu Asn Cys He Ser Gly Pro Val Lys Ala Thr 
15 10 15 

Ala Pro Val Gly Gly Pro Lys Arg Val Leu Val Thr Gin Gin Phe Pro 



25 20 25 



30 



Cys Gin Asn Pro Leu Pro Val Asn Ser Gly Gin Ala Gin Arg Val Leu 
35 40 45 

Cys Pro Ser Asn Ser Ser Gin Arg Val Pro Leu Gin Ala Gin Lys Leu 
50 55 60 

Val Ser Ser His Lys Pro Val Gin Asn Gin Lys Gin Lys Gin Leu Gin 

'° 75 eo 

Ala Thr Ser Val Pro His Pro Val Ser Arg Pro Leu Asn Asn Thr Gin 
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85 90 95 

jjys Ser Lys Gin Pro Leu Pro Ser Ala Pro Glu Asn Asn Pro Glu Glu 
100 105 110 

Glu Leu Ala Ser Lys Gin Lys Asn Glu Glu Ser Lys Lys Arg Gin Trp 
115 120 125 

5 Ala Leu Glu Asp Phe Glu He Gly Arg Pro Leu Gly Lys Gly Lys Phe 

130 135 140 

Gly Asn Val Tyr Leu Ala Arg Glu Lys Gin Ser Lys Phe He Leu Ala 
145 150 155 160 



10 



Leu Lys val Leu Phe Lys Ala Gin Leu Glu Lys Ala Gly Val Glu His 
165 170 175 

Gin Leu Arg Arg Glu Val Glu He Gin Ser His Leu Arg His Pro Asn 
180 185 190 

He Leu Arg Leu Tyr Gly Tyr Phe His Asp Ala Thr Arg Val Tyr Leu 
195 200 205 

15 He Leu Glu Tyr Ala Pro Leu Gly Thr Val Tyr Arg Glu Leu Gin Lys 

210 215 220 

Leu ser Lys Phe Asp Glu Gin Arg Thr Ala Thr Tyr He Thr Glu Leu 
225 230 235 240 

20 Ala Asn Ala Leu Ser Tyr Cys His Ser Lys Arg Val He His Arg Asp 

245 250 255 

He Lys Pro Glu Asn Leu Leu Leu Gly Ser Ala Gly Glu Leu Lys He 
260 265 270 

Ala Asp Phe Gly Trp Ser Val His Ala Pro Ser Ser Arg Arg Thr Thr 
25 275 280 285 

Leu cys Gly Thr Leu Asp Tyr Leu Pro Pro Glu Met He Glu Gly Arg 
290 295 300 

Met His Asp Glu Lys Val Asp Leu Trp Ser Leu Gly Val Leu Cys Tyr 
305 310 315 320 

30 Glu Phe Leu Val Gly Lys Pro Pro Phe Glu Ala Asn Thr Tyr Gin Glu 

325 330 335 

Thr Tyr Lys Arg He Ser Arg Val Glu Phe Thr Phe Pro Asp Phe Val 
340 345 350 

Thr Glu Gly Ala Arg Asp Leu He Ser Arg Leu Leu Lys His Asn Pro 
35 355 360 365 
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Ser Gin Aarg Pro Met Leu Arg Glu Val Ijeu Glu His Pro Trp lie Thr 
370 375 380 

Ala Asn Ser Ser Lys Pro Ser Asn Cys Gin Asn hys Glu Ser Ala Ser 
385 390 395 400 

Lys Gin Ser 



BNSOOCID: <WO_G72Z702A1_L> 



wo 97/22702 



PCT/US96/18859 



86 



What is claimed is: 



1 . An isolated, enriched or purified nucleic 
acid molecule encoding AUR-1 and/or AUR-2 polypeptide. 

2 - The nucleic acid molecule of claim 1 
5 wherein said nucleic acid is human nucleic acid, 

3 . The nucleic acid molecule of claim 1 
wherein said molecule encodes at least 25 contiguous 
amino acids of the amino acid sequence shown in SEQ ID 
NO : 3 or SEQ ID NO : 4 . 

10 4 . The nucleic acid molecule of claim 1 

wherein said molecule encodes at least 50 contiguous 
amino acids of the amino acid sequence shown in SEQ ID 
NO: 3 or SEQ ID NO: 4. 

5 - The nucleic acid molecule of claim 1 
15 wherein said molecule encodes at least 100 contiguous 
amino acids of the amino acid sequence shown in SEQ ID 
NO : 3 or SEQ ID NO : 4 • 

6 . The nucleic acid molecule of claim 1 
wherein said molecule encodes at least 200 contiguous 
20 amino acids of the amino acid sequence shown in SEQ ID 
NO: 3 or SEQ ID NO: 4. 
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7. The nucleic acid molecule of claim 1 
wherein said molecule encodes at least 300 contiguous 
amino acids of the amino acid sequence shown in SEQ ID 
NO: 3 or SEQ ID NO:4. 

8. A nucleic acid probe for the detection of 
5 nucleic acid encoding an AUR-i and/or AUR-2 polypeptide 

in a sample. 

9. The probe of claim 8 wherein said 
polypeptide comprises at least 25 contiguous amino acids 
of the amino acid sequence shown in SEQ ID NO: 3 or SEQ 

10 ID NO: 4. 

10. The probe of claim 8 wherein said 
polypeptide comprises at least 50 contiguous amino acids 
of the amino acid sequence shown in SEQ ID NO: 3 or SEQ 
ID NO : 4 . 

IS 11- The probe of claim 8 wherein said 

polypeptide comprises at least 100 contiguous amino 
acids of the amino acid sequence shown in SEQ ID NO: 3 or 
SEQ ID NO: 4. 

12. The probe of claim 8 wherein said 
20 polypeptide comprises at least 200 contiguous amino 

acids of the amino acid sequence shown in SEQ ID NO: 3 or 
SEQ ID NO:4. 
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13. The probe of claim 8 wherein said 
polypeptide comprises at least 300 contiguous amino 
acids of the amino acid sequence shown in SEQ ID NO: 3 or 
SEQ ID NO: 4. 

14 • A recombinant nucleic acid molecule 
5 encoding an AUR-1 and/or AUR-2 polypeptide and a vector 
or promoter effective to initiate transcription in a 
host cell. 

15- A recombinant nucleic acid molecule 
comprising a transcriptional region functional in a 
10 cell, a sequence complementary to an RNA sequence 
encoding an AUR-1 or AUR-2 polypeptide and a 
transcriptional termination region functional in a cell. 

16. An isolated, enriched or purified AUR-1 
or AUR-2 polypeptide. 

15 17. The AUR-1 or AUR-2 polypeptide of claim 

16 wherein said polypeptide comprises at least 25 
contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO:4. 

18. The AUR-1 or AUR-2 polypeptide of claim 
20 16 wherein said polypeptide comprises at least 50 

contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO:4. 
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19. The AUR-1 or AUR-2 polypeptide of claim 
16 wherein said polypeptide comprises at least 100 
contiguous amino acids of the amino acid sequence shown 
in SEQ ID N0:3 or SEQ ID NO:4. 

20. The AUR-1 or AUR-2 polypeptide of claim 
5 16 wherein said polypeptide comprises at least 200 

contiguous amino acids of the amino acid sequence shown 
in SEQ ID N0:3 or SEQ ID NO:4. 

21. The AUR-1 or AUR-2 polypeptide of claim 
16 wherein said polypeptide comprises at least 3 00 

10 contiguous amino acids of the amino acid sequence shown 
in SEQ ID N0:3 or SEQ ID N0:4. 

22. The AUR-1 or AUR-2 polypeptide of claim 
16 wherein said polypeptide is isolated, purified, or 
enriched from a mammal . 

15 23. The AUR-1 or AUR-2 polypeptide of claim 

16 wherein said polypeptide is isolated, purified, or 
enriched from a human. 

24. The AUR-1 or AUR-2 polypeptide of claim 
16 wherein said polypeptide is an AUR-1 polypeptide. 

20 25. The AUR-1 or AUR-2 polypeptide of claim 

16 wherein said polypeptide is an AUR-2 polypeptide. 
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26. An antibody having specific binding 
affinity to an AUR-1 or AUR-2 polypeptide. 

27. The antibody of claim 26 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 3 
contiguous amino acids of the amino acid sequence shown 

5 in SEQ ID NO: 3 or SEQ ID NO:4. 

28. The antibody of claim 26 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 4 
contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO:4. 

10 29- The antibody of claim 26 wherein said 

AUR-1 or AUR-2 polypeptide comprises at least 25 
contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO: 4. 

30. The antibody of claim 26 wherein said 
15 AUR-1 or AUR-2 polypeptide comprises at least 50 

contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO:4. 

31. The antibody of claim 26 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 100 

20 contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO:4. 
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32. The antibody of claim 26 wherein said 
polypeptide is isolated, purified, or enriched from a 
mammal • 

33. The antibody of claim 26 wherein said 
polypeptide is isolated, purified, or enriched from a 

5 human . 

34 . The antibody of claim 26 wherein said 
polypeptide is an AUR-1 polypeptide. 

35. The antibody of claim 26 wherein said 
polypeptide is an AUR-2 polypeptide. 

10 36. A hybridoma which produces an euitibody 

having specific binding affinity to a AUR-1 and/or AUR-2 
polypeptide . 

37. The hybridoma of claim 36 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 25 

15 contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO: 4. 

38. The hybridoma of claim 36 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 50 
contiguous amino acids of the amino acid sequence shown 

20 in SEQ ID NO: 3 or SEQ ID NO: 4. 

39. The hybridoma of claim 36 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 100 
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contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO:3 or SEQ ID N0:4. 

40. The hybridoma of claim 36 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 200 
contiguous amino acids of the amino acid sequence shown 

5 in SEQ ID NO: 3 or SEQ ID NO: 4. 

41. The hybridoma of claim 3 6 wherein said 
AUR-1 or AUR-2 polypeptide comprises at least 300 
contiguous amino acids of the amino acid sequence shown 
in SEQ ID NO: 3 or SEQ ID NO:4. 

10 42 • The hybridoma of claim 36 wherein said 

polypeptide is isolated, purified, or enriched from a 
mammal . 



43. The hybridoma of claim 36 wherein said 
polypeptide is isolated, purified, or enriched from a 
15 human. 



44 . The hybridoma of claim 36 wherein said 
polypeptide is an ADR-1 polypeptide. 

45. The hybridoma of claim 44 wherein said 
polypeptide is an AUR-2 polypeptide. 
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